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Abstract 

Statistical pattern classification methods based on data-random graphs were introduced recently. In 
this approach, a random directed graph is constructed from the data using the relative positions of the 
data points from various classes. Different random graphs result from different definitions of the proximity 
region associated with each data point and different graph statistics can be employed for data reduction. The 
approach used in this article is based on a parameterized family of proximity maps determining an associated 
family of data-random digraphs. The relative arc density of the digraph is used as the summary statistic, 
providing an alternative to the domination number employed previously. An important advantage of the 
relative arc density is that, properly re-scaled, it is a (7-statistic, facilitating analytic study of its asymptotic 
distribution using standard [/-statistic central limit theory. The approach is illustrated with an application 
to the testing of spatial patterns of segregation and association. Knowledge of the asymptotic distribution 
allows evaluation of the Pitman and Hodges-Lehmann asymptotic efficacy, and selection of the proximity 
map parameter to optimize efficacy. Notice that the approach presented here also has the advantage of 
validity for data in any dimension. 



1 Introduction 

Classification and clustering have received considerable attention in the statistical literature. In recent years, 
a new classification approach has been developed which is based on the relative positions of the data points 
from various classes. Priebe et al. introduced the class cover catch digr aphs (CCCD) in R an d gave the exact 
and th e asymptotic distributi on of the domination number o f the CCCD (iPriebe et al. ([2001 )). DeVinnev et al 



(|2002f) . Marchctte and Priebe iMarchette and Priebd l|2003f) . IPriebe et al.l (|2003bf) . iPriebe et all (|2003a[) applied 
the concept in higher dimensions and demonstrated relatively good performance of CCCD in classification. 
The methods employed involve data reduction (condensing) by using approximate minimum dominating sets 
as prototype sets (since finding the exact minimum dominating set is an NP-hard problem — in particular for 
CCCD). Furthermore the exact and the asymptotic distribution of the domination number of the CCCD are 
not analytically tractable in multiple dimensions. 

Ceyhan and Priebe introdu ced the central similarity pr oximi ty map and r-factor proxim ity maps and the 
associated random digraphs in Ceyhan and Prieb3 ( 2003a ) and Ceyhan and Prieb3 ( 2003bl ). respectively. In 
both cases, the space is partitioned by the Delaunay tessellation which is the Delaunay triangulation in R^. 
In each triangle, a family of data-random proximity catch digraphs is constructed based on the proximity of 
the points to each other. The advantages of the r-factor proximity catch digraphs are that an exact minimum 
dominating set can be found in polynomial time and the asymptotic distribution of the domination number is 
a nalytically tractable. The la tter is then used to test segregation and association of points of different classes 
in lCevhan and Priebe (|2003dl . 



In this article, we employ a different statistic, namely the relative (arc) density, that is the proportion 
of all possible arcs (directed edges) which are present in the data random digraph. This test statistic has 



the advantage that, properly rescaled, it is a [/-statistic. Two simple classes of alternative hypotheses, for 
segregation and association, are defined in Section 12.51 The asymptotic distributions under both the null and 
the alternative hypotheses are determined in Scction[3]by using standard [/-statistic central limit theory. Pitman 
and Hodges-Lehmann asymptotic efficacy are analyzed in Sections 13.71 and 13.81 respectively. This test is related 
to the available tests of segregation and association in the literature, such as Piclou's test and Ripley's test. See 
discussion in Section 3] for more detail. Our approach is valid for data in any dimension, but for simplicity of 
expression and visualization, will be described for two-dimensional data. 



2 Preliminaries 



2.1 Proximity Maps 



Let be a measurable space and consider a function N : fl x p(fi) p{^), where p(-) represents the 

power set functional. Then given 3^ C H, the proximity map Ny{-) = N(-,y) : Q —^ associates with each 

point X € n a proximity region Ny{x) C il. Typically, N is chosen to satisfy x £ Ny(x) for all x € fl. The use 
of the adject ive proximity comes form t hinking of the region N ^jx) as representing a neighborhood of points 
"close" to X ( Jaromczvk and Toussaint ( 19921 ): ToussainT i 1980l ) ) . 



2.2 r-Factor Proximity Maps 



We now briefly define r-factor proximity maps (see Cevhan and Prieb^ ( 2003b( ) for more details). Let $7 = 
and let y = {yi,y2,y3} C be three non-coUinear points. Denote by T{y) the triangle — including the 
interior — formed by the three points. For r G [1, oo], define Ny to be the r-/actor proximity map as follows; see 
also Figure [TJ Using line segments from the center of mass of T{y) to the midpoints of its edges, we partition 
T{y) into "vertex regions" i?(yi), R{y2), and i?(y3). For x £ T{y) \ y, let v{x) G 3^ be the vertex in whose 
region x falls, so x G R{v(x)). If x falls on the boundary of two vertex regions, we assign v{x) arbitrarily to 
one of the adjacent regions. Let e{x) be the edge of T{y) opposite v{x). Let £{x) be the line parallel to e{x) 
through X. Let d{v{x), £{x)) be the Euclidean (perpendicular) distance from v{x) to £{x). For r £ [l,oo), let 
£r{x) be the fine parallel to e(x) such that d{v{x),£r{x)) — rd{v{x),£{x)) and d{£{x),£r{x)) < d{v{x),£r{x)). 
Let Tr{x) be the triangle similar to and with the same orientation as T{y) having v{x) as a vertex and £r{x) as 
the opposite edge. Then the r-factor proximity region Ny{x) is defined to be Tr{x) n T{y). Notice that r > 1 

„ N^{x) = T{y) for aU x £ T{y) \ y, so we define N^{x) = T{y) for 
{x} for all r £ [1, oo]. 



implies x £ Ny{x). Note also that lim^- 
all such X. For a; e 3^, we define Ny{x) 




Figure 1: Construction of r-factor proximity region, Ny{x) (shaded region). 



2.3 Data-Random Proximity Catch Digraphs 



If Xn ■= {Xi, X2, • • • , Xn} is a set of ri-valued random variables, then the Ny(Xi), i = 1, • • • ,n, are random 
sets. If the X^ are independent and identically distributed, then so arc the random sets Ny{Xi). 

In the case of an r-factor proximity map, notice that if Xi F and F has a non-degenerate two-dimensional 
probability density function / with support(/) C T(3^), then the special case in the construction of Ny — X 
falls on the boundary of two vertex regions — occurs with probability zero. 

The proximities of the data points to each other are used to construct a digraph. A digraph is a directed 
graph; i.e. a graph with directed edges from one vertex to another based on a binary relation. Define the 
data-random proximity catch digraph D with vertex set V = {^i, • • • , Xn\ and arc set A by {Xi, Xj) G A 
Xj G Ny{Xi). Since this relationship is not symmetric, a digraph is needed rather than a graph. The random 
digraph D depends on the (joint) distribution of the Xi and on the map Ny. 



2.4 Relative Density 

The relative arc density of a digraph D = {V,A) of order |V| = n, denoted p{D), is defined as 

1^1 



PiD) 



n{n — 1) 



where | • | denotes the set cardinality functional (jJanson et al.l ()2000f )). 

Thus p{D) represents the ratio of the number of arcs in the digraph D to the number of arcs in the complete 
symmetric digraph of order n, which is n{n — 1). For brevity of notation we use relative density rather than 
relative arc density henceforth. 

If Xi, ■ ■ ■ ,Xn *~ F the relative density of the associated data-random proximity catch digraph D, denoted 
p{Xn;h, Ny), is a ?7-statistic, 

p{Xn■,h,Ny) = —^-—Y,Y.''(X^,Xf,Ny) (1) 

nln — 1 ^ — ' — ' 

where 



h{X„Xj-Ny) = l{{X,,X,)(^A} + l{{Xj,X,)eA} 

= i{x,eNy{x,)} + i{x,eNy{x,)}. (2) 

We denote h{Xi, Xj] Ny) as hij for brevity of notation. Although the digraph is asymmetric, hij is defined 
as the nu mber of a r cs in D between vertices Xi and Xj , in order to produce a symmetric kernel with finite 
variance (|Lehmannl (|l988h ;i. 

The random variable p„ := p(A'„; h, Ny) depends on n and Ny explicitly and on F implicitly. The expecta- 
tion E[p„], however, is independent of n and depends on only F and Ny: 

< E[p„] = ^Eihri] < 1 for all n>2. (3) 
The variance Var[p„] simplifies to 

1 V — 2 

< Var[p„] = — -Var[h,2] + --Cov[/ii2, /iia] < 1/4. (4) 

2n[n — 1) n(n — 1) 

A central limit theorem for [/-statistics ( Lehmannl (|l988[) ) yields 

v/^(p„ - E[p„]) ^ AA(0, Cov[;ii2, /lis]) (5) 

provided Cov[/ii2, /113] > 0. The asymptotic variance of p„, Cov[ft,i2, /113], depends on only F and Ny. Thus, 
we need determine only E[ft,i2] and Cov[/ii2, /113] in order to obtain the normal approximation 

T approx , , , ,^ /E[/li2] Cov [ft,i2 , /113] \ , , 

Pn ^ 7v(E[/9„],Var[p„]) =7V I — — , for large n. (6) 



2.5 Null and Alternative Hypotheses 



The phenomenon known as segregation involves observations from different classes having a tendency to repel 
each other — in our case, this means the tend to be located away from all elements of 3^. Association 
involves observations from different classes havi ng a tendency t o attract one another , so that the Xi tend to be 
located near an element of y. See, for instance, DixonI ( 1994h . Coomes et al. ( 19991 ). For statistical testing for 



segregation and association, the null hypothesis is generally some form of complete spatial randomness] thus we 
consider 

Ho-.xi'^uiTiy)). 

If it is desired to have the sample size be a random variable, we may consider a spatial Poisson point process 
on T{y) as our null hypothesis. 

We define two simple classes of alternatives, and Hf with e € (O, v^/3), for segregation and association, 
respectively. For y £ y, let e(y) denote the edge of T{y) opposite vertex y, and for x G T{y) let £y{x) denote 
the line parallel to e(y) through x. Then define T(y, e) = {a; e T{y) : d{y,£y{x)) < e}. Let ijf be the model 

under which X, - U{T{y) \ UyeyT{y,e)) and Hf be the model under which Xi - U{UyeyT{y,VS/3 - e)). 
Thus the segregation model excludes the possibility of any Xi occurring near a yj , and the association model 
requires that all Xi occur near a yj. The -s/S/S — e in the definition of the association alternative is so that 
e = yields Hq under both classes of alternatives. 

Remark: These definitions of the alternatives are given for the standard equilateral triangle. The geometry 
invariance result of Theorem 1 from Section 3 still holds under the alternatives, in the following sense. If, in an 
arbitrary triangle, a small percentage 6 ■ 100% where 6 G (0,4/9) of the area is carved away as forbidden from 
each vertex using line segments parallel to the opposite edge, then under the transformation to the standard 
equilateral triangle this will result in the alternative j—_ — This argument is for segregation with 5 < 1/4; a 

V 3(5/ 4 

similar construction is available for the other cases. 



3 Asymptotic Normality Under the Null and Alternative Hypothe- 
ses 

First wc present a "geometry invariance" result which allows us to assume T{y) is the standard equilateral 
triangle, T((0,0), (1,0), (l/2, V3/2)), thereby simplifying our subsequent analysis. 

Theorem 1: Let 3^ = {yi,y2,y3} C M? be three non-coUinear points. For z = 1, • • • , ti let Xi F = 
U [T{y)), the uniform distribution on the triangle T{y). Then for any r £ [1, oo] the distribution of p{Xn', h, Ny) 
is independent of 3^, hence the geometry of T{y). 

Proof: A composition of translation, rotation, reflections, and scaling will transform any given triangle 
To = r(yi,y2,y3) into the "basic" triangle Tb = T((0,0), (1,0), (ci,C2)) with < ci < 1/2, C2 > and (1 - 

ci)^ + C2 < 1, preserving uniformity. The transformation 0e : M'^ ^ M'^ given by c/)e(u, v) ~ (^u + ^^^^ 

takes Tfc to the equilateral triangle Tg = ^((0, 0), (1,0), (l/2, \/3/2)). Investigation of the Jacobian shows that 
(pe also preserves uniformity. Furthermore, the composition of 0e with the rigid motion transformations maps 
the boundary of the original triangle To to the boundary of the equilateral triangle Te, the median lines of To to 
the median lines of Te, and lines parallel to the edges of To to lines parallel to the edges of Te- Since the joint 
distribution of any collection of the hij involves only probability content of unions and intersections of regions 
bounded by precisely such lines, and the probability content of such regions is preserved since uniformity is 
preserved, the desired result follows. ■ 

Based on Theorem 1 and our uniform null hypothesis, we may assume that T{y) is the standard equilateral 
triangle with y = {{0, 0), (1, 0), (l/2, V3/2)} henceforth. 

For our r- factor proximity map and uniform null hypothesis, the asymptotic null distribution of Pn{r) = 
p{Xn]h,Ny) can be derived as a function of r. Let ^(r) := E[/9„(r)] and v{r) := Cov[/ii2, /iia]. Notice that 
p{r) = E[/ii2]/2 = P{X2 S Ny(Xi)) is the probability of an arc occurring between any pair of vertices. 



3.1 Asymptotic Normality under the Null Hypothesis 



By detailed geometric probability calculations, provided in Appendix 1, the mean and the asymptotic variance 
of the relative density of the r-factor proximity catch digraph can explicitly be computed. The central limit 
theorem for [/-statistics then establishes the asymptotic normality under the uniform null hypothesis. These 
results are summarized in the following theorem. 



Theorem 2: For r e [l,oo), 



(7) 



where 



37 „2 
216 

o Z 

2 ' 



for re [1,3/2) 
for rG [3/2,2) 
for r e [2, oo) 



and 



iy{r) = 1.1 (r) I(r £ [1,4/3)) + U2{r) I(r £ [4/3, 3/2)) + i^3{r) I(r e [3/2, 2)) + ly^ir) I(r G [2, (X)]) 



(8) 



(9) 



with 



3007 



13824 + 898 + 77760 - 1 17953 + 



- 24246 r'' + 60480 - 38880 + 



V2 



(r) 



58320 r4 

5467 r"^ - 37800 + 61912 r** + 46588 - 191520 + 13608 + 241920 - 155520 + 15552 



233280 r* 



'^3(^) = -[7r^^ - 72r" +312r^'' - 5332r-* + 15072r^ + 13704r'^ - 139264 + 273600 - 242176r^ 



+ 103232 - 27648 r + 8640]/ [960 7 



15 r* - llr^ - 48 r- + 25 



15 r6 

For r = oo, p„(r) is degenerate. 
See Appendix 1 for the proof. 




Figure 2: Asymptotic null mean /i(r) (left) and variance v{r) (right) from Theorem 2. The vertical lines 
indicate the endpoints of the intervals in the piecewisc definition of the functions. Notice that the vertical axes 
are differently scaled. 



Consider the forms of the mean and asymptotic variance functions, which are depicted in Figurc[2] Note that 
/x(r) is monotonically increasing in r, since Ny{x) increases with r for all x G RcAiiVj) \ ^s{Ny, Mc), where 
^sIn^,Mc) := {x £ T{y) : N^{x) T{y)} . In addition, ^i{r) ^ 1 as r oo (at rate O {r-^)), since the 



digraph becomes complete asymptotically, which explains why Pnir) becomes degenerate, i.e. v^r = oo) = 0. 
Note also that fi{r) is continuous, with the value at r = 1, = 37/216 .1713. 

Regarding the asymptotic variance, note that i/{7') is also continuous in r with lim^^oo i^ir) ~ and = 
34/58320 w .000583 and observe that sup^>i v{r) w .1305 at argsup^>i v{r) w 2.045. 

To illustrate the limiting distribution, r ~ 2 yields 
V^(p„(2)-m(2)) 



\/K2) 



or equivalently, 



Pn(2) 



approx . r / 5 25 



192ri 



The finite sample variance and skcwncss may be derived analytically in much the same way as was Cov[/ii2, ft.13] 
for the asymptotic variance. In particular, the variance of hi2 is 

w(r) = Var[/ii2] = wi,i(r) I(r G [1, 4/3))+ 

wi,2(r) I(r e [4/3, 3/2)) + coi,3(r) I(r G [3/2, 2)) + uj^^^ir) I(r G [2, 00)) 



where 



Wi,i(r-) 

'^l,2('") 

t^i,3(r) 
ti'i,4(r-) 



' ( 1369 + 4107 r'' + 902 r*^ - 78084 + 161784 - 182736 - 23328 + 155520 r - 55296) 

11664 (r + 2)(r + l)r2 ' 

1369 + 4107 r'' + 9650 - 98496 + 132624 r^* - 79056 - 57888 r + 72576 

11664 (r + 2)(r + l)r ' 
r^" +3r-^ - 62r** + 968r'' - 1704 - 1824 r* + 5424 - 1168 - 3856 r + 2208 



16 (r + 2)(r + l)r4 



3r'^ + 3r^ + 3r- 13 




Figure 3: Var[ft,i2] = Lu{r) as a function of r for r G [1, 5] 



InFigure[3]isthegraphofw(r) forr G [1,5]. Note that tj(r = 1) = 2627/11664 w .2252 and linv^oo =0 
(at rate O {r~'^)), argsup^gj;^ q^-j uj{r) w 1.66 with sup^gj;^ q^-) uj{r) w .6796. 

In fact, the exact distribution of Pnir) is, in principle, available by successively conditioning on the values of 
Xi. Alas, while the joint distribution of /ii2,/ii3 is available, the joint distribution of {/iij}i<i<j<n, and hence 
the calculation for the exact distribution of Pnir), is extraordinarily tedious and lengthy for even small values 
of n. 



Figure 13] indicates that, for r = 2, the normal approximation is accurate even for smaU n (although kurtosis 
may be indicated for n = 10). Figure [5] demonstrates, however, that severe skewness obtains for small values of 
n and extreme values of r. 




Figure 4: Depicted are the distributions of Pn(2) ^^K"^ j\f ylf;^) for 10, 20, 100 (left to right). Histograms 
are based on 1000 Monte Carlo replicates. Solid curves represent the approximating normal densities given in 
Theorem 2. Note that the vertical axes are differently scaled. 




Figure 5: Depicted are the histograms for 10,000 Monte Carlo replicates of pio{l) (left) and pio(5) (right) 
indicating severe small sample skewness for extreme values of r. 

Letting Hn{r) = X]r=i ^{-^u -^n+i) ^ the exact distribution of Pn(r) can be written as the recurrence 

(n + l)npn+iir) =n{n- l)p„(r) + i/„(r) 

by noting that the conditional random variable Hn{r)\Xn+i is the sum of n independent and identically dis- 
tributed random variables. Alas, this calculation is also tedious for large n. 



3.2 Asymptotic Normality Under the Alternatives 

Asymptotic normality of relative density of the proximity catch digraphs under the alternative hypotheses of 
segregation and association can be established by the same method as under the null hypothesis. Let Ef [•] 
( E^[-]) be the expectation with respect to the uniform distribution under the segregation ( association ) 
alternatives with e € (O, v^/3). 

Theorem 3 Let psir,e.) be the mean Ef[/ii2] and i/s(r, e) be the covariance, Covf [/112, /113] for r S [l,oo] 

and e G [O, V3/3) under i/f , y/n{pn{r) — fis{r, e)) — ^ Ar(0, lysir, e)) for the values of (r, e) for which iys{r, e) > 0. 
Likewise for Hf. 



Sketch of Proof: Under the alternatives, i.e. e > , /5„(r) is a J7-statistic with the same symmetric kernel 
hij as in the null case. Under , the mean /^^(r, e) = Ef [p„(r)] = Ef [/ii2]/2, now a function of both r and 
e, is again in [0,1]. The asymptotic variance 1^5 (r,e) = Covf [/ii2, /iia], also a function of both r and e, is 
bounded above by 1/4, as before. Thus asymptotic normality obtains provided 1^5(7', e) > 0; otherwise Pnir) is 
degenerate. Likewise for Hf. 

The explicit forms of pLs{r,e) and yLA(r^e) are given, defined piecewise, in Section ??. Sample values of 
^5(r, e), vsi^i e)j and ^JLA{r, e), VA{r, e) are given in Section [5 . 8 . 1 1 under segregation with e = -y/S/S, ■\/3/4, 2 -\/3/7 
and in Section .8 . 21 under association with e = 5 \/3/24, a/3/12, \/3/21. Note that under ijf , 

i/s(r, e) > for (r, e) € [l, V3/(2e)) x (o, ^3/4] U [l, VS/e - 2) x (V3/4, Vs/s) , 

and under H^^ 

VA{r,e) > for (r, e) e (l,oo) x (o, \/3/3) U {1} x (o, V3/12) . ■ 

Notice that under the association alternatives any r e (l,oo) yields asymptotic normality for all e e 
(0, a/3/3), while under the segregation alternatives only r = 1 yields this universal asymptotic normality. 



3.3 The Test and Analysis 



The relative density of the proximity catch digraph is a test statistic for the segregation/association alternative; 
rejecting for extreme values of pn{r) is appropriate since under segregation we expect pn{r) to be large, while 
under association we expect Pni^) to be small. Using the test statistic 

R=MPnir)_Kr))^ (10) 

the asymptotic critical value for the one-sided level a test against segregation is given by 

z„ = $-i(l-a) (11) 

where $(•) is the standard normal distribution hmction. Against segregation, the test rejects for R > Za and 
against association, the test rejects for R < zi_q,. 



3.4 Consistency 

Theorem The test against which rejects for R > and the test against which rejects for R < 
are consistent for r € [l,oo) and e S (O, a/3/3). 

Proof: Since the variance of the asymptotically normal test statistic, under both the null and the al- 
ternatives, converges to as n — > 00 (or is degenerate), it remains to show that the mean under the null, 
p{r) — E[p„(r)], is less than (greater than) the mean under the alternative, psi^i^) — Ef [pn(r)] against segre- 
gation (/x^(r, e) — E^[p„(r)] against association) for e > 0. Whence it will follow that power converges to 1 as 
n —>■ 00. 

Detailed analysis of p-sir, e) in Appendix 2 indicates that under segregation psi^y ^) > pii") for all e > and 
r E [1, 00). Likewise, detailed analysis of pAir, e) in Appendix 2 indicates that under association pAir, e) < p{r) 
for all e > and r G [l,oo). Hence the desired result follows for both alternatives. ■ 

Remark: In fact, the analysis of /xs(r, e) and pA{r,e) under the alternatives reveals more than what is 
required for consistency. Under segregation, the analysis indicates that ps{r,ei) < ps{r,t2) for ei < 62. 
Likewise, under association, the analysis indicates that /x^(r, ei) > pA{r^€2) for ei < £2. □ 



3.5 Monte Carlo Power Analysis Under Segregation 

In segregation alternatives with e > 0, we implement the above described Monte Carlo experiment for various 
values of r (for which pnir) is non-degenerate). Recall that Pnir) is degenerate for large r at each e > 0. In 
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Figure 6: Kernel density estimates for the null (solid) and the segregation alternative H^^^ (dashed) for 
r = 1, 11/10, 6/5, 4/3, V2, 3/2, 2, and 3 (left-to-right). 



particular, p„ (r, -^/S/S) is degenarate for r > 4, /9„ (r, V3/4) is degenarate for r > 2, and pn (r, 2\/3/7) is 
degenaratc for r > 3/2. 

Let (Ofe(n) be the empirical relative density for experiment k and {n) be the j*^ (ordered) empirical relative 
density for j ~ 1,. . . ,N. Then for each r value, we estimate the empirical critical value Cf := P(\(i-a) N^){n) 
and the empirical significance level af^^{n) :~ ;^ under Hq and the empirical power 

^Lin, e) := ^ I {p, > ) under ijf with e = V3/8, ^/3/4, 2 ^3/7. 

For segregation with e = -\/3/8 ~ .2165, we run the Monte Carlo experiments for eight r values: 1, 11/10, 6/5, 4/3, ^/2, 3/2, 
and 3. In Figure [6] are the kernel density estimates for the null case and the segregation alternative with 
e = V3/8, for the eight r values with n = 10 and = 10, 000. Observe that under both Hq and H^^^, kernel 

density estimates are skewed right for r = 1, 11/10, (with skewncss increasing as r gets smaller) and kernel 
density estimates are almost symmetric for r ~ 6/5, 4/3, ^/2, 3/2, 2, with most symmetry occurring at r — 3/2, 
kernel density estimate is skewed left for r = 3 (with skewness increasing as r gets larger). 

The empirical critical values, empirical significance levels, and empirical power estimates under H^^^ are 
presented in Table [H 

In Figure [71 we present a Monte Carlo investigation against the segregation alternative H^^^ for r = 11/10, 
and n = 10, = 10,000 (left), n = 100, N = 1000 (right). With n = 10, the null and alternative probability 



r 


1 


11/10 


6/5 


4/3 


V2 


3/2 


2 


3 




0.24 


.3 


.35 


.4 


.5 


.5 


.82 


0.98 




.0324 


.0403 


.0484 


.0442 


.0446 


.0492 


.049 


.0389 


/3L (10, V3/8) 


.0381 


.0787 


.122 


.1571 


.1719 


.1955 


.2791 


.2901 



Tabic 1: The empirical critical values, empirical significance levels, and empirical power estimates under 
N = 10, 000, and n = 10 at a = .05. 
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Figure 7: Two Monte Carlo experiments against the segregation alternative H^^^. Depicted are kernel density 
estimates for p„(ll/10) for n = 10 (left) and n — 100 (right) under the null (solid) and alternative (dashed). 



density functions for pio(ll/10) are very similar, implying small power (10,000 Monte Carlo replicates yield 
(10, 73/8) = 0.0787, = 0.15, and af„c(10) = 0.0484). With n = 100, there is more separation 

between null and alternative probability density functions; for this case, 1000 Monte Carlo replicates yield 
(100, ^3/8) = 0.77, Cl = 0.2203, and 2^^(100) = 0.05. Notice also that the probabihty density functions 

are more skewed for n = 10, while approximate normality holds for n ~ 100. 

For segregation with e = V3/4 ~ .433, we run the Monte Carlo experiments for six r values, 1, 11/10, 6/5, 
4/3, V2, 3/2. In Figure[8l are the kernel density estimates for the null case and the segregation alternative with 
e = V3/4, for the six r values with n = 10 and TV = 10, 000. Observe that under H^^^, kernel density estimate 

is skewed right for r ~ 1 and kernel density estimates are almost symmetric for r = 11/10, 6/5, 4/3, \/2, with 
most symmetry occurring at r = 4/3, kernel density estimate is skewed left for r = 3/2. 

The empirical critical values, empirical significance levels, and empirical power estimates under H^^^ are 



presented in Table [2l 



r 


1 


11/10 


6/5 


4/3 


72 


3/2 




.24 


.3 


.35 


.4 


.5 


.5 




.0318 


.0411 


.0479 


.0484 


.0481 


.043 


PL (10, V3/4) 


.1247 


.9138 


.998 


1.0 


1.0 


1.0 



Table 2: The empirical critical values, empirical significance levels, and empirical power estimates under H^^^, 
N = 10, 000, and n = 10 at a = .05. 



For segregation with e = 2 a/3/7 ~ .495, we run the Monte Carlo experiments for six r values, 1, 21/20, 
11/10, 6/5, 4/3, v^. In FigurelHl are the kernel density estimates for the null case and the segregation alternative 
with e = 2 V3/7, for the six r values with n = 10 and N = 10, 000. Observe that under H^^^^, kernel density 

estimate is skewed right for r = 1 and kernel density estimates are almost symmetric for r = 21/20, 11/10, 6/5, 
with most symmetry occurring at r = 6/5, kernel density estimate is skewed left for r = \/2- 

The empirical critical values, empirical significance levels, and empirical power estimates under H^^^^ are 



relative density relative density relative density 
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Figure 8: Kernel density estimates for the null (solid) and the segregation alternative H^^^ (dashed) for 
r = 1, 11/10, 6/5, 4/3, V2, 3/2 (left-to-right). 
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Figure 9: Kernel density estimates for the null (solid) and the segregation alternative H^^^^ (dashed) for 
r = 1, 21/20, 11/10, 6/5, 4/3, ^2 (Icft-to-right). 



r 


1 


21/20 


11/10 


6/5 


4/3 


V2 




.24 


.28 


.3 


.35 


.4 2 


.5 




.0318 


.0447 


.0411 


.0479 


.0477 


.0481 


/3^JlO, 2V3/7) 


.1247 


.9728 


1.0 


1.0 


1.0 


1.0 



Tabic 3: The empirical critical values, empirical significance levels, and empirical power estimates under 
, N = 10, 000, and n = 10 at a = .05. 




Figure 10: Monte Carlo power using the empirical critical value against segregation alternatives H^^^ (left), 
^V3/i ^^'^ ^2\/2,/7 (''^St't) as a function of r, for n = 10. 



presented in Table [3l 

We also plot the empirical power as a function of r in Figure [TOI Let r*g (e) be the value of r at which 
maximum Monte Carlo power estimate occurs, then rg(-\/3/8) = 3. Furthermore, Monte Carlo power estimate 
increases as r gets larger and then decreases, due to the magnitude of r and n. Because for small n and large 
r, the critical value is approximately 1 under Hq, as we get a complete digraph with high probability. 

Furthermore, r* (V3/4) e {4/3, V2, 3/2} and r* (2^3/7) e {ll/lO, 6/5, 4/3, \/2}. Monte Carlo power 
estimates increase as r gets larger. The phenomenon happened above for e = \/3/8 does not occur, because r 
values are not large enough to yield complete digraphs under Ho with high probability. 



asymptotic critical value — as a function of the proximity factor r. Let Rj := ^ ^ ^ ^ ^ be the standardized 



For a given alternative and sample size, we may consider analyzing the power of the test — using the 

\/n'(pj(»)-M(r-)) 

relative density for experiment j with sample size n for j = 1, 2, . . . , iV. For each r value, the level a asymptotic 
critical value is /x(r) + 2:(i_q,) • y/ v{r)/n. We estimate the empirical power as /3f (r, e) := SjLi I (^j > -Zi-a)- 
In Figure [TTJ we present a Monte Carlo investigation of power /3,f(r, e) against H^^^, H^^^, and H^^^^ 
as a function of r for n = 10. The empirical significance level is a^^^{n) :— -"■ i-^j -zi-ali^o)- Then 

5;s(10), is about .05 for r = 2, 3 which have the empirical power /^fp (r, -\/3/8) ~ .35, and /3fQ(r, e) = 1 
for e = •\/3/4, 2 •\/3/7. So, for small sample sizes, moderate values of r are more appropriate for normal 
approximation, as they yield the desired significance level and the more severe the segregation, the higher the 
power estimate at each r. 

The empirical significance level, and empirical power f3^{r, e) values under for e = a/3/8, \/3/4, 2 a/3/7 
are presented in Table U) Note that even for n = 10, the plots of the empirical power I3^ {r,e) resemble the 
curves of the asymptotic power function Us (r) in Section 13.91 



3.6 Monte Carlo Power Analysis Under Association 

In association alternatives with e > 0, we implement the Monte Carlo experiment for r G {l, 11/10, 6/5, 4/3, 
•\/2, 3/2, 2, 3, 5, lO}. Then for each r value, we estimate the empirical critical value := P{[aN]) ^-nd the 



Figure 11: Monte Carlo power using the asymptotic critical value against segregation alternatives H^^^ 
(left), ^^3/4 (middle), and H^^^^ (right) as a function of r, for n = 10. The circles represent the empirical 
significance levels while triangles represent the empirical power values. 



r 


1 


11/10 


6/5 


4/3 


V2 


3/2 


2 


3 


5 


as{n) 


.2829 


.2019 


.1486 


.1224 


.1139 


.0966 


.0619 


.0374 


.000 


(r,^/3/8) 


.3086 


.3309 


.3123 


.3233 


.3365 


.3317 


.3175 


.2950 


.0000 


(r,V3/4) 


.6519 


.9985 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


.0000 


/3~^(r,2V3/7) 


.6508 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


.0000 



Table 4: The empirical significance level and empirical power values under for e ~ •\/3/8, \/3/4, 2-\/3/7, 
N = 10, 000, and n = 10 at a = .05. 



empirical significance level a^^{n) ^ < ^nj under i^g and the empirical power (3^^^(n, e) := 

■i- X^jLi I i^Pj < ) • We implement the Monte Carlo simulation for three e values; 5 \/3/24, 
73/12, 73/21. 

The empirical critical values, empirical significance levels, and empirical power estimates under are 
presented in Table O 

For association with e = 5 \/3/24 « .36, in Figure fT2j are the kernel density estimates for the null case and 
the segregation alternative for the ten r values with n = 10, iV = 10, 000. Observe that, under iJo, kernel density 
estimates are skewed right for r = 1, 11/10, (with skewness increasing as r gets smaller) and kernel density 
estimates are almost symmetric for r ~ 6/5, 4/3, -\/2, 3/2, 2, with most symmetry occurring at r = 3/2, kernel 
density estimates are skewed left for r = 3, 5, 10, (with skewness increasing as r gets larger). Under H^^^^^, 

kernel density estimates are skewed right for r = 1, 11/10, 6/5, 4/3, 3/2, 2, 3, (with skewness increasing as r 
gets smaller) and kernel density estimate is almost symmetric for r = 5, kernel density estimate is skewed left 
for r = 10. 

For association with e = \/3/12 w .144, in Figure [T51 are the kernel density estimates for the null case and 
the segregation alternative for the ten r values with n = 10, iV = 10,000. Observe that under H^^^^, kernel 

density estimates are skewed right for r = 1, 11/10, 6/5, 4/3, (with skewness increasing as r gets smaller) and 
kernel density estimates are almost symmetric for r = -^2, 3/2, 2, with most symmetry occurring at r = 2, 
kernel density estimates are skewed left for r = 3, 5, 10, (with skewness increasing as r gets larger). 

Note also that for r = 11/10 with n = 10, = 1000, the kernel density estimates are very similar, implying 
small power. With TV = 10,000 P^c{W, V3/12) = 0.0921, = 0.15, and 2^^(10) = 0.0484. See Figure 
1141 Note that for large n, there is more separation between null and alternative kernel densities, which implies 
higher power. With n ^ 100, N 1000 and get = 0.1963, 2^^(100) = 0.049, and P:^^ (lOO, V3/12) = 0.56. 

For association with e = \/3/21 w .0825, in Figure [TSl are the kernel density estimates for the null case and 
the segregation alternative for the ten r values with n = 10, iV = 10,000. Observe that under H^^^^, kernel 

density estimates are skewed right for r = 1, 11/10, 6/5, 4/3, (with skewness increasing as r gets smaller) and 
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;ure 13: Kernel density estimates for the null (solid) and the association alternative H^^_^^ (dashed) for 
= 1, 11/10, 6/5, 4/3, V2, 3/2, 2, 3, 5, 10 (left-to-right). 
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Figure 14: Two Monte Carlo experiments against the association alternative H^^^^. Depicted are kernel 

density estimates for p„ (11/10) for n = 10 (left) and n = 100 (right) under the null (solid) and alternative 
(dashed). 



r 


1 


11/10 


6/5 


4/3 


V2 


3/2 


2 


3 


5 


10 


riA 


.13 


.14 


.16 


.2 


.2 


.24 


.4 2 


.65 


.82 


.91 







.0112 


.0208 


.0308 


.0363 


.0359 


.0392 


.0413 


.0478 


.0398 


/3';^,(10, 5V3/24) 





.0213 


.0754 


.2052 


.3253 


.4365 


.946 


.9993 


.9473 


.4242 


/?;^,(10, ^/3/12) 





.0921 


.0645 


.1448 


.2002 


.2274 


.2739 


.1383 


.0823 


.0639 


/?^,(10, V3/21) 





.0151 


.0364 


.0605 


.0746 


.0771 


.0764 


.0618 


.0501 


.0518 



Table 5: The empirical critical values, empirical significance levels, and empirical power estimates under 
for e = 5 \/3/24, V^/12, \/3/21 and n = 10 at a = .05. 

kernel density estimates are almost symmetric for r = v^, 3/2, 2, with most symmetry occurring at r = 3/2, 
kernel density estimates are skewed left for r = 3, 5, 10, (with skewness increasing as r gets larger). 

We also plot the empirical power as a function of r in Figure 1161 Let r\ (e) be the value at which maximum 
Monte Carlo power estimate occurs. Then r\ (5^3/24) = 3, r\ (%/3/12) = 2, and for r\ (^3/21) = 3/2. 
Notice that the more severe the association the larger the value of r\. Based on the analysis of the Monte Carlo 
power estimates, we suggest moderate r values for moderate association. 

We also estimate the power using the asymptotic critical value in association alternatives for various values 
of r. For each r value, the level a asymptotic critical value is /i(r) + ■ \/v(r)ln. We estimate the empirical 

power as I5^{r, e) ^ Y.f=i U^j < ^a)- 

In Figure [T71 we present a Monte Carlo investigation of power against H^^^^, H^^^^, and H^^^^^ as a 
function of r for n = 10. The empirical significance level is aA{n) :— X^^i -'■(^j < Za\Ho). Then aA{^^); is 
about .05 for r = \/2, 3/2, 2, 3, 5 which have the empirical power (r, -\/3/12) < .35 with maximum power 

at r = 2, and (r = 3, 5 •\/3/24) = 1. So, for small sample sizes, moderate values of r are more appropriate 
for normal approximation, as they yield the desired significance level, and the more severe the association, the 
higher the power estimate. 

The empirical significance levels and empirical power /3f (r, e) values under for e = 5 \/3/24, -^3/12, a/3/21 
are presented in Table [51 

Note that even for n = 10, the plots of the empirical power /3j^(r) resembles the curves of the asymptotic 
power function 11^ (r) in Section [531 
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Figure 15: Kernel density estimates for the null (solid) and the association alternative H^^^^ (dashed) for 
r^l, 11/10, 6/5, 4/3, ^2, 3/2, 2, 3, 5, 10 (Icft-to-right). 



r 


1 


11/10 


6/5 


4/3 


V2 


3/2 


2 


3 


5 


10 




.5318 


.2426 


.1869 


.1031 


.0673 


.0559 


.0656 


.0627 


.0771 


.0955 




.6273 


.3663 


.3923 


.4103 


.4167 


.5316 


.9610 


.9983 


.9656 


.5443 


(3fo{r, VS/12) 


.6300 


.3537 


.3583 


.3190 


.2698 


.2919 


.3433 


.1825 


.1429 


.1261 




.6012 


.2979 


.2574 


.1629 


.1190 


.1077 


.1098 


.0889 


.0989 


.1033 



Table 6: The empirical significance level and empirical power values under for e ~ 5 V3/24, V3/12, \/3/21 
with N = 10, 000, and n = 10 at a = .05. 



Figure 16: Monte Carlo power using the empirical critical value against association alternatives -^^^3^21 (^*^^^)' 
H^^, (middle) and ^ , , (right) as a function of r, for n = 10. 

v3/12 ^ ^ 5 v3/24 ^ ^ ' 




Figure 17: Monte Carlo power using the asymptotic critical value against association alternatives ^^3^21 
(left), H^^^^ (middle), and H^^^^^ (right) as a function of r, for n = 10. The circles represent the empirical 
significance levels while triangles represent the empirical power values. 



3.7 Pitman Asymptotic Efficacy 



Suppose that the distribution F under consideration may be indexed by a set O C M and consider Hq : 9 = 0q 
versus Ha ■ 9 > 9q. 

Pitman asymptotic efScacy (PAE) provides for an investigation of "local asymptotic power" — local around 
Hq. This involves the limit as n — > oo as well as the limit as e — > 0. 

Consider the comparison of test sequences S = { -Sn } satisfying the following conditions in a neighborhood 
6* e [0q^9q + 5] of the null parameter for some i5 > 0. 

Pitman's Conditions: 



(PCI) For some functions /i„(0) and (Tn{9), the distribution Fg of ^Sn — Hn 
uniformly on [^o, + d], i.e., 



sup sup 

9o<d<9o+5 teR 



P 



Sn — I^ni9) 
<Jn{0) 



< t 



I /an{9) converges to Z ^ 7V(0, 1) 



as n — !■ oo. 



(PC2) For 9 e [9q, 6q + 6], fin{9) is differentiable with fi'n{9o) > 0, 
(PC3) For 9n = 9Q + {n-^/^), lim„^oo = 1, 

(PC4) For 9n = 9o + (n-i/2), lim„^oo = 1. 

(PC5) For some constant c > 0, 

lim-^ 



Condition (PCI) is equivalent to 

(PCI)' For some functions /x„ (6> ) and (Tn (9), t he distribution Fg of [Sn — Mn(^'n)] /c'ri(^ri) converges to a standard 
normal distribution fsee lEedenI ( 1963 )). 

Note that i^^iit\9o) > and /iL'^(6'o) = 0, for alH = 1, 2, . . . , fc-1, then ii'^{9o) in ( PC2), (PCS), and (PCS) c an 
be replaced by n'h \9o) > and ^ini9n) in (PC3) can be replaced by nil'\9n) fsee lKendall and StuartI (|l979| )). 

Lemma 1: (Pitman-Noether) 

(i) Let S = {Sn} satisfy (PC1)-(PC5). Consider testing Hq by the critical regions Sn > with a„ ~ 
Poo {Sn > Ua„) a as n ^ oo where a S (0,1). For (3 <E (0, 1 — a) and 0„ = 9q + O (ri"^/^) , we have 

Pn{On) = Pe„{Tn > ^ /? iff c -9)^ <p-\l -a)- 

(ii) Let S = {Sn} and Q = {Qn} each satisfy satisfy (PC1)-(PC5). Then the asymptotic relative efficiency 
of S relative to Q is given by ARE{S, Q) ~ {cs/cq)^ . 

Thus, to evaluate ARE{S,Q) under the conditions (PC1)-(PC5), we need only calculate the quantities eg 
and CQ, where 



cs = lim 



and cq ~ lim 



PAE{S) = c| is called the Pitman Asymptotic Efficacy (PAE) of the test based on Sn- Using similar notation 
and terminology for Qn, 

^^^(^'^)-pAE(Qy 

For segregation or association alternatives the PAE of Pnir) is given by PAE(r) = -^^ — — where k is 

the minimum order of the derivative with respect to e for which ^'^^^ (r, e = 0) 7^ 0. That is, /i*^*'' (r, e = 0) 7^ 
but n^^\r, e = 0) = for Z = 1, 2, . . . , A: - 1. 



3.7.1 Pitman Asymptotic Efficacy Under Segregation Alternatives 



Consider the test sequences p{r) = {pn{f)} for sufficiently small e > and r e [l, \/3/{2e)). 

In the PAE framework above, 9 = e and 9o — 0. Suppose, /^„(e) = £'f [p„(r)] — [is{t, e). For e e [O, V3/8), 

5 

with the corresponding intervals X\ = 1,3/2 — \/3e], X2 = 3/2 — \/3e,3/2], X3 = 3/2,2 — 4 e/Vs), 



14= 2 — 4 e/V3, 2j, X5 = 2, V3/(2e)j. Sec Appendix 2 for the explicit form of n{r,e) and Appendix 3 for 

derivation. Notice that as e 0, only Ii = 1, 3/2 - x/3 , X3 = 3/2, 2-4 e/Vtj , X5 = 2, V3/(2 e)) do not 
vanish, so we only keep the components of psi^, e) on these intervals. 

Furthermore, cr|(n, e) = Varf (p„(r)) = Varf [feia] + i^s('', e) = Covf [/ii2, ft.13]. The explicit 

forms of Varf [/112] and Covf [/ii2, ft-ia] are not calculated, since we only need lim„_foo o'ni^ = 0) = iy{r) which 
is given in Equation [Sj 

No tice that I fei2p < 8 < 00 and Ef [/ii2 /iia]— Ef [/ii2]^ = Covf [/ii2, /iia] > then bv lCallaert and Janssen 
(Il978l) 

Pn{r) - Hs{r, e)) 



sup tgi 



<t\- m 



<CEf I/112I' \vs{r.e)\-^ n-5 



where C is an absolute constant and $(•) is the standard normal distribution function. Then (PCI) follows for 
eachrG 1, \/3/(2 e)^ and e e 0,^3/4^ 

Differentiating ps (r, e) with respect to e yields 

M's(r, e) = t^'i,i(r, e) I (r G [l, 3/2 - V3 e)) + w'^^^(r, e) I G [3/2, 2 - 4 e/Vs)^ 

+ t^'i,5(r,6)l(re [2,^/3/(2e)) 

where 

, 2e(144e2(r2-l) + 36- 37r2) 

'^1^1^'^''^ ~ 27(2e-l)3(2e+l)3 

tu'i 3(r, e) = 2 \/3((2r - 3) 64 + (7r2 + r"^ - 24 r + 20) 16 ^Fi(? + (r - 3) 48 e + 3 \/3 + 96 \/3 ? 
-36V3-60%/3r2)e / 9 (2 e + 1)=^(2 e - 1)^^ 

8 V3e (48 e3 + (Sr^ + 3r2 - 20) 4 ^3 + 36 e + 9 V3 - 9 V3 r^) 



27r2(2e+l)3(2e- 1)3 



Hence, /Zg(?-, e = 0) = , so we need higher order derivatives for (PC2). A detailed discussion is available in 
Kendall and StuartI (|l979t ). 



Differentiating /ig(r, e) with respect to e yields 

MS(r,e)=n7;;i(r,e)l(re [l, 3/2 - \/3 e)) + n7i'_3(r, e) I (r e [3/2, 2 - 4 e/Vs)) 

+ <5(r,e)l(re [2,V3/(2e)) 



where 

w'/ i(r, e) 



2 (r2 - 1) 1728 e'' + (72 - 77 r^) 4 + 36 - 37 
27 (4 £2 _ 1)4 

(2 r - 3) 512 V3 + (20 + + 7 - 24 r) 576 e"^ + (2 r - 3) 1024 ^3 + (20 - 108 r 
mr + ^r^) 36 + (-3 + 2 r) 96 VS e - 108 + 9 - 180 + 288 r] / [9 r2(2 e + 1)^(2 e - 1) 



w'l^{r, e) = -8 [l28 \/3 + (-20 + 3 r^^ + 3 r^) 48 e'^ + 256 VSe^ + (-5 - 12 + 3 r^) 12 



24 e V3 + 9 - 9 



/ 9r2(2e+l)4(2e-l)^ 



Thus, 



-2 



8 + 74 ^2 
3 ^ 27 ' 

(,.2_4 f.+2)(r^+4 r- 



i(l-r-") 
— 7i 



for re [1,3/2) 
for re [3/2,2) 
for r e [2, V3/(2e)). 



(12) 



Observe that /x'^(r, e 0) > for all r e 1, V3/(2 e) j , so (PC2) holds with the second derivative. (PC3) in the 
second derivative form follows from continuity of ^JL'g{r, e) in e and (PC4) follows from continuity of cr^ (r, e) in e. 



Next, we find cs{p{r)) = lim„_ 



Ms(r,(r=0) 



where numerator is given in Equation [12] and 



denominator is given in Equation [51 We can easily see that cs(p(r)) > 0, since cs{p{r)) is increasing in r and 
cs{p{r = 1)) > 0. Then (PC5) follows. So under segregation alternatives Hf , the PAE of Pn{f) is given by 



PAE^(r) = 4(p(r)) 



v{r) 



« 600 

H 

<: 

Oh 




Figure 18: Pitman asymptotic efficacy against segregation (left) and against association (right) as a function 
of r. 

In Figurc[T8l(left). we present the PAE as a function of r for segregation. Notice that PAE'^(r = 1) = 160/7 w 
22.8571, limr^oo PAE'^(r) = 00. Based on the PAE analysis, we suggest, for large n and small e, choosing r 
large for testing against segregation. However, for small and moderate values of n, normal approximation is not 
appropriate due to the skewness in the density of Pn(r), Therefore, for small n, we suggest moderate r values. 

PAE analysis is local (around e = 0) and for arbitrarily large n. The comparison would hold in general 
provided that p[r, e) is convex in e for all e e [O, V3/3). As an alternative, we fix an e and then compare the 
asymptotic behaviour of Pnir) with Hodges-Lehmann asymptotic efficacy in Section [3.8.1l 



3.7.2 Pitman Asymptotic Efficacy Under Association Alternatives 



Consider the test sequences p{r) = {pn{f)} for sufficiently small e > and r E [l,oo). 

In the PAE framework above, 9 — e and — 0. Suppose, /^^(e) = E£[p„(r)] = ^^(r, e). For e £ 
0, (7 V3 - 3 VTS) /12 « .042^ , 

6 

i=i 

with the corresponding intervals li = 1, (l + 2 V3e) / (l — \/3 e)^ , ^2 = (l + 2 \/3e) / (l — \/3 e) , 

4 (1- V3e)/3), X3 = [4 (l-x/3e)/3,4 (l + 2V3e)/3), X4 = [4 (l + 2 x/3 e) /3, 3/(2 (l- VSe))), X5 = 

3/ (2 (1 — a/3 e)), 2^ and Xg = [2, 00). Notice that as e — > 0, only X, for j = 2, 4, 5, 6 do not vanish, so we only 

keep the components of HAir, e) on these intervals. See Section ?? for the explicit form of pAir, e) and Section 
?? for derivation. 

Furthermore, aKe) = Varf{p„ {r)) = Varf [^12] + Covf [/ii2, /113] whose explicit form is not 

calculated, since we only need lim„^oo \/n(Tn{e = 0) = i^(7') which is given Equation [9l 

(PCI) follows for each r G [1, 00) and e E [O, \/3/3) as in the segregation case. 

Differentiating pAif, e) with respect to e, then we get 

e) = ^[,2ir, e) I(r- G [1, 4/3)) + n7;,4(r, e) I(r € [4/3, 3/2)) 

+ w'ir, e) I(r G [3/2, 2)) + w'{r, e) I(r G [2, 00)) 



where 



^'1.2 (^e) = -2 



\/3( -1152 rV + 720 Vir^e^ - 288 r'' e + 11 \/3 r'' + 2592 \/3 r^e^ - 10368 \/3 re^ 



+ 432\/3r2 + 6480V3e2- 864 V3r + 432\/3je / (-6e+\/3j fGe + VSj 



: -2 ^\/3(^-1152 rV + 720 VS r'^e^ - 288 r'* e + 11 \/3 r"^ - 1296 VS r^e^ + 108 VS 

2160 %/3e2 - 144 Vs^e / (^-6e + %/3)^ (6e + \/3)% 
2 e (3 - 72 r2 - 240 + 192 r - 124) 



r2(12e2 _ 1)3 



40 e 



r2(12e2 - 1)2- 

Hence /^^(r, e = 0) = 0, so we differentiate /x^(r, e) with respect to e and get 

M:;(r, e) = w'l^{r, e) I(r G [1, 4/3)) + w'l^{r, e) I(r G [4/3, 3/2)) 

+ w'Ur, e) I(r G [3/2, 2)) + m'' (r, e) I(r G [2, ex.)) 



where 



\/3( -27648 r^e^ + 25920 V^r^e^ - 18432 r'^e^ + 2820 \/3 r^e^ + 93312 VSr^e^ 



^1,2 (''jf) = -6 

- 576 e - 373248 V3 re" + 1 1 \/3 r'' + 33696 VSr^e^ + 233280 VSe"^ ~ 82944 ^/3^e2 
+ 432 V3 + 45360 VSe^- 864 \/3r + 432 Vs) / (g e + (^-6 e + V^j^ , 

wli{r,e) = -6 V3(-27648rV + 25920 %/3r*e^- 18432 r^'e^ + 2820 VSr^e^ - 46656 VSr^e'^ 

- 576 e + 11 V3 + 2592 Vs r^e^ _ 7776O VSe^ + 108 \/3 - 15120 VSe^ - 144 Vs 



/[(6e + Vs)^ 



2 (180 r^e^ + 3 r'' - 4320 r^e^ _ 8640 + 11520 re^ - 72 - 8160 + 192 r - 124) 



Thus, 



40 (36 £2 + 1) 
r2(12e2 - 1)3 



/.:;(r,e = 0) = 



r2(12e2- 1)4 



r2 + 192r~i - 96r-2 _ 96 for re [1,4/3) 
r2 + 32r-2_24 for rG [4/3,3/2) 

-6r2- 384 r-i+248r-2 + i44 for re [3/2,2) 



-40 



for r e [2, 00) 



(13) 



Note that a*a(?', e = 0) > for all r G [l,oo), so (PC2) follows with the second derivative. (PC3) and (PC4) 
follow from continuity of /^^(r, e) and a'^{r,e) in e. 



Next, we find C/i(/9(r)) = lim„_ 



fj.'[{r,e=0) 



by substituting the numerator from Equation [13] 



and denominator from Equation [^1 We can easily see that CA{pir)) < 0, for all r > 1. Then (PC5) holds, so 
under association alternatives Hf, the PAE of Pn{r) is 



PAE^(r)=.ci(p(r)) 



In Figure fT8l (right), we present the PAE as a function of r for association. Notice that PAE'*(r = 1) = 
174240/17 « 10249.4118, lim^^oo PAE^(r) = 0, argsup^g[i^^) PAE'^(r) « 1.006 with supremum « 10399.7726. 

PAE'^(r) has also a local supremum at r; « 1.4356 with local supremum « 3630.8932. Based on the Pitman 
asymptotic efficacy analysis, we suggest, for large n and small e, choosing r small for testing against association. 
However, for small and moderate values of n normal approximation is not appropriate due to the skewness in 
the density of Pnif)- Therefore, for small n, we suggest moderate r values. 

We also calculate Hodges-Lehmann asymptotic efficacy for fixed alternatives in Section [3.8.21 



3.8 Hodges-Lehmann Asymptotic Efficacy 

Unlike PAE. HLAE does not involve the limit as e — + 0. Since this requires the mean and, especially, the 
asymptotic variance of Pnir) under an alternative, we investigate HLAE for specific values of e. See Appendix 
4 for a sample derivation of /^5(r, e) and vsii^j^)- 



3.8.1 Hodges-Lehmann Asymptotic Efficacy Under Segregation Alternatives 

In the HLAE framework. 6 ~ Ef [p„(r)] = /^s(r, e) and Oq = p{r). Then testing Hq : e = versus : e > is 
equivalent to Hq : E\pJ = u(r) versus Hf : Ef[p„l = pslr, e) > n(r). Let S = f^s(r.e)~,^{r) ^ ^ PM-Kr) ^ 

then Af{6,l). ~~ ■ 



Then HLAE of Pn{r) is given by 



HLAE^(r, e) 



We calculate HLAE of p„(r) under ijf for e = V3/8, e = \/3/4, and e = 2 V3/7. 
With e = V3/8, pn{r) is non-degenerate for r G [1,4) 



and 



where 



f 2287 2 
9126 



13 



for re [1,9/8) 

r 5905 +62910 -46656 r+13122 fo^, r £[9/8,3/2) 

61 r*-768r^+3494 r^-5120 r+2466 
338 

3r''-422r''+606 



338 

3 -48 r'' +530 r^- 768 
338 



for re [3/2,2) 
for r e [2, 3) 
for r e [2, 4] 



12 

i^s (r, \/3/8) = J2 ('^' ^Z^) 



(r, VS/Sj = [9959911 - 46006272 - 430526 

- 83976048 + 201277440 - 
(^r,\/3/8) = [9959911 r"' 

- IE 

(r, %/3/8) = [; 



8 



r- + 258785280 r ' - 385799609 +- 162699264 
129392640 



12939264j / [l04104845 r^j 
r'" - 46006272 - 430526 + 258785280 - 415110891 r*^ + 272331072 i 
- 158725008 - 16174080 + 315394560 - 310542336 r + 90574848] / [l04104845 r 
-- [3144167 r^^ + 15335424 r" - 378655166 r^" + 2750459904 - 11800111467 r** 

- 31878202752 r'^ - 54792387144 + 60339341664 - 42745183272 r" +- 19903426272 

- 6790168926 + 1989715104 r - 373071582] / [l04104845 r^] , 



Vi ( r. 



fe ( r. 



V7 



vio [r 



1^12 { T. 



\/3/8j = - [8177689 r^^ - 54153216 r" + 320428478 - 2459326464 + 11854698987 



- 32751603072 r'^ + 55010737224 - 59029241184 
+ 7379714142 - 



2 r ' + 55010737224 - 59029241184 r'' + 42131073672 - 20886001632 
- 1694942496 r + 170415414j / ^104104845 r**] , 
■, \/3/8j = - 1^8177689 r^^ - 54153216 + 320428478 r-^° - 2459326464 + 12509010411 
- 37904305536 r"^ + 71918042184 - 88617024864 



■, \/3/8) = - 



10724592861 - 1694942496 r + 170415414] / [l04104845 r"] . 
2718937 r - 39596544 



71256548232 - 36176875776 



+ 434455742 r"' - 315481. , , , 
39680803584 + 72881433288 - 88893062496 + 71547681672 
36487418112 + 10828106973 - 1694942496 r + 170415414] / [l04104845 r^] , 

- [l027 r^^ - 19968 r" + 295626 - 3265792 + 23210081 
103077696 r'^ + 289042360 - 511170304 r'"' + 553668600 r** - 343186304 
+ 109133095 - 20431008 r + 5845554] / ^428415 r*^] , 



■, \/3/8) = - [637 r^^ - 19968 + 299370 - 3265792 + 23199551 1 



i/g (r, v^/8) = - [( 



- 103077696 r'^ + 289042360 - 511170304 + 553700190 r'' - 343186304 : 
+ 109133095 - 20431008r + r ^ 



5788692] / [428415 r^] , 

637 r^^ - 19968 + 299370 - 3265792 + 24051519 
- 112023360 r'^ + 328179640 r" - 602490624 + 673558110 - 
+ 133604087 - 20431008 r + 5788692j / [428415 r^] , 

73/8^ = [l30 r^^ - 2496 r" + 22134 - 122720 r** + 452225 r** - 

+ 26624 r-^ - 1993566 r"* + 5324800 - 5083895 + 303264 r - 37908] / [428415 r®] , 



- 427086848 
' 1010880 r'^ + 1075400 r*^ 



_i (^r, V3/8) = -[330r* - 
- 1181 

^/3/8)=-(^ 



8896 + 85445 - 342624 + 332000 + 1148560 ) 

- 1180986 r-^ - 5324800 r + 6678947] / [428415 r"] , 

4936 r* + 12453 + 47388 - 12992 r - 128256) (r - 
428415 



-A 

\ ■ / 428415 r* 

and the corresponding intervals arc Xi = [1,12/11), I2 = [12/11,9/8), X3 — ^ - ; i -1 - 

[21/16,4/3), Xe = [4/3,3/2), I7 = [3/2, Vtj, Is = [%/3,7/4), Xg - [7/4,2), Xio = [2,3), In = 

[7/2,4). See SeCtinn f™- Hprivntinn and trnrp [Wil fnr thp oranh nf // ^'t- ^^'"^ ' ^ ^ 



V6/2, 21/16), X5 
[3,7/2), Xi2 = 



L-/ "I "/ -y: -I I 3/2, \/3 J, -LH ~ o, < /-iy , — L' -^lU — L^: -^H " I 

;ion ?? for derivation and Figure for the graph of /i (r, V3/8) and (r, V3/8) 
Then we get HLAE^ (r, V3/8) = ^^"^ ts"(!,V^/8)''''^ '^^ substituting the relevant terms. See Figure [H 



With e = V3/4, r e [1,2) 

,1s (r-,V3/4) 



-ir2 + fr-3 for re [1,3/2) 



54 '9 
7 'r^--48r^ + 122r^-128r+48 
27^ 



and 



for re [3/2,2) 

5 

1/5 (r, V3/4) = ^ (r, ^/3/4) I(X, ) 



where 



233266 r'^ + 1106688 - 2021199 + 1876608 



3431808 



3 rj 
- 233266 1 



i/i {r, \/3/4) = - [l4285 r'^ - 28224 r*^ - 

- 880794 r + 165888] / [3645 
V2 {r, \/3/4j = - [l4285 r^" - 28224 - 233266 + 1106688 r'' - 1234767 - 
+ 14049126 - 22228992 + 18895680 - 8503056 r + 
V3/4) =-[14285 7-^" 

13456044/ + 20636208 - 

15844032 
r- - 81881280 r-^ 
r-^ + 1331492 r'^ 



- 1594323]/ [3645 r*], 

{^■, \/3/4^ = - [l4285 r^" - 28224 - 233266 r** + 1106688 r - 2545713 + 5903280 

- 13456044 + 20636208 - 18305190 + 8503056 r - 1594323] / [3645 r*] , 
I (r,V3/4^ = [104920 - 111072 r'' + 1992132 r^- 15844032 r-^ + 50174640 r'' + 6377292 

- 34012224 r + 73220760 - 81881280 + 1909 - 27072 r^] / [l4580 r"] , 

U5 (j; Vs/'i) = - [-1187904 + 1331492/ + 433304 + 611163 r"' - 850240 - 198144 r 
+ 955392/ - 705536/ - 387680 r" + 1118472 - 1308960 r'^ + 175984 r^^ 

- 46176 + 5120 r" + 56016] / [20 /] , 

and the corresponding intervals arc Xi = [1,9/8), I2 = [9/8,9/7), I3 = [9/7,4/3), I4 = [4/3,3/2), 
[3/2, 2). See Figure [20] for the graph of ns {r, V3/4) and vs {r, ^3/4). 

Then we get HLAE (r, ^3/4) = ''^'^ ts'(!,7l/4)'''^^ substituting the relevant terms. See Figured 
With e = 2 V3/7, r e [1,3/2) 

/ \ f - 241 2 I 38 „ _ o 

^^s{r,2V3/7)-) ^ ' + " ^ 



I5 = 



-o for re [1,9/7) 

80 r^+866 -756 r+243 



and 
where 



^1 ( r, 2 V3/7 
1^2 (r, 2 73/7 



1^3 



1/4 



r, 2 V3/7 



-,2^3/7 



for re [9/7, 3/2) 

6 

(r,2V3/7) =^l^, (r,2^/3/7) I(X,) 

230713503/ 



= - 2495087 / - 5067342 / - 29145379 / + 134149248 / - 
+ 202262778 / - 90317349 r + 16336404] / [l4580 1 ' 

- 2495087 ~ 5067342 / - 

- 378587142 / + 1465530651 / - 2206303596 / + 1786050000 / - 765450000 1 
+ 136687500] 

= - 2495087 - 5067342 / - 29145379 r 
+ 731864538 / - 1559738349 / + 2174176 

- 136687500]/[l4580/ 



- 90317349 r- 

134149248/- 140359071/ 
I + 1465530651 / - 2206303596 / ^ 
/[l4580 / 

134149248 - 309668679 7 
1559738349 / + 2174176404 / - 1767825000 / + 765450000 ) 



1000147/ -654768 / 
- 1767825000/- 



- 321426/]/ 



14580/ 

I n AT// 24337 e 17857 5 

1^5 r, 2 V3/7 = 

H ' 'J 14580 810 



77561559 / - 527363136 / + 1468526760 
765450000 r - 2157840000 / + 136687500 + 24337 



r 
,10 



1000147 4 18188 . 174113 n 8176 

r"* r-^ H ; 

14580 405 1620 45 



(8r 


6 - 106/ + 


8709 - 39684 


+ 68000 - 


51192 r 


+ 










20/ 









Figure 19: Hodges-Lehmann asymptotic efficacy against segregation alternative as a function of r for 
e = x/3/8, V3/4, 2 ^3/7 (left to right). 



The corresponding intervals are Jj = [1,15/14), I2 = [15/14,15/13), I3 = [15/13,7/6), I4 = [7/6,5/4), J5 = 
[5/4, 9/7), Je = [9/7, 3/2). See Figure HOI for the graph of us (r, 2 ^/3/7) and z^s (r, 2 ^/3/7). 

Then we get HLAE^ (r, 2 \/3/7) = (A's(r,2 ^^/^^^^^^(^-^^ ^/^)) by substituting the relevant terms. In Figure 

[Hare the graphs of HLAE^(r, e) for e ^ VS/S, %/3/4, 2 V3/7. 

From Figure [T9l we see that, under ijf , HLAE'^(r, e) appears to be an increasing function, dependent on 
e, of r. Let ^^(e) be the minimum r such that Pn{r) becomes degenerate under the alternative . Then 
rs (^3/8) = 4, rs (%/3/4) = 2, and rs (2^3/7) = 3/2. In fact, for e £ (O, ^/3/4] , r5(e) = v^/(2e) and for 
e G (\/3/4, \/3/3) , rs{e) = \/3/e— 2. Notice that limr^ra(£) HLAE'^(p„(r), e) = oo, which is in agreement 
with PAE analysis because as e — > HLAE becomes PAE, and as e ^ 0, rs{e) 00 and under Hq, Pn{r) is 
degenerate for r = 00. The above result for HLAE can also be generalized for arbitrary e as follows. 

Proposition 1 Let r := argsup^gj^ rs{e}] HLAE'^(r, e) where ^^(e) is the value of r at which pn{r) becomes 
degenerate under ijf . Then r — rs{e). In particular, for e e [O, V3/4] , — -/S/ (2 e) and for e e (•\/3/4, •\/3/3] , 
rs = V3/e - 2. 

Proof: Recall that HLAE^(r,e) = ^^""^i^^^'I'f/''^'* ■ For e € [0,V3/4], fis{r,e) ^ 1 and j/(r, e) 
as r — (e) = \/3/(2e). Hence HLAE'^(r,e) — ^- cx) as r — > (e) = V3/(2e). So for e e [O, V3/4] , the 
F = V3/(2 e). For e e f V3/4, V3/3 



the result follows similarly. ■ 

So HLAE suggests choosing r larger as the segregation gets more severe, but choosing r too large will reduce 
power since r > rs{e) guarantees the complete digraph under the alternative and, as r increases therefrom, 
provides an ever greater probability of seeing the complete digraph under the null. 

In Figure [201 we plot the graphs of mean and asymptotic variance for r G [1,4] under segregation with 
e = 0, -y/S/S, "\/3/4, 2 ■\/3/7. Notice that e) gets larger as e gets larger at each r which is in agreement with 
the ^5(r, e) expressions in Section ??. However, the same ordering holds for iys{r,e) at each r only for large 
r, but for small r the ordering is reversed. Furthermore, both the sup,.g[]^_o2] iys{r,e) and argsup,,gj]^ i^5(r, e) 
seem to decrease as e increases. 



3.8.2 Hodges-Lehmann Asymptotic Efficacy Under Association Alternatives 

In the HLAE framework, 9 = E^[p„(r)] = yUyi(r, e) and Oq = iJ,A{r,e = 0). Then testing Hq : e = versus 
Ha : e > is equivalent to Hq : E[p„] = /i(r) versus Ha : E^[p„] = fiAir^e) < Let 5 = ^''^''^(7 e^''"' ^'^'^ 



1 1.5 2 ,,. 2.5 3 3.5 




1 1.5 2 2.5 3 3 5 4 



Figure 20: The mean fJ-sif, e) (left) and asymptotic variance e) (right) as a function of r under segregation 
with e = 0, V3/8, V3/4, 2 V3/7. 



Hodges-Lehmann asymptotic efhcacy (HLAE) ( Hodges and Lehmann ( 19561 )') is given by 



HLAE^(r,.) ^=M!4Z^ 
''Air, e) 

Rather than an arbitrary e we pick specific values: 5-\/3/24, -\/3/12, and ■\/3/21. Recall that pni 
degenerate as r oo. Furthermore Pnir) is degenerate when r = I. 

With e = 5 \/3/24, 



r is 




"6 



for 



^e[l,3) 

1^2_|^_^^-2^M for 
_55 j,-2 



L^-^ + 1 



^ ... re [3,4) 
for r G [4, oo) 



and 



where 



5 

(r, 5 V3/24) = ^ (r, 5 V3/24) I(Xj ; 



/ ^ , \ r'' - 2 + 1 
V2 (r,5 V3/24) = -[l20ri" 



V3. 



(r, 5 \/3/24) 
(r, 5 \/3/24) 

^ . 787 r4- 7601 r2- 16032 
.,,(r,5V3/24). ^ 



50304 + 58754 + 74880 - 248577 1 
23328 r- 7305 
120 r^" - 2176 + 15180 - 
+ 138240 - 70477 



2176 + 15340 - 50304 r' + 58754 r" 
138240 + 47172 + 23328 r- 7305 / 405 



48960 + 58754 + 47440 - 176547 j 

23328 i 



/ 



405 



10 r^^ - 192 r" + 1320 r^" 



i r - 7305 
2944 - 

- 23520 r+ 7315 



7590 + 49920 - 69986 ~ 46480 
184137 r'^ ~ 143360 + 71917 - 23520 r + 7315] / 



405 r*^ 



-9265 



The corresponding intervals are T\ = (1, 3), I2 = [3, 7/2), X3 = 
See Figure [22] for the graph of p,A {r, 5 a/3/24) and {r, 5 \/3/2 



7/2,2 + VS), X4 = [2 + ^3,4), I5 - [4,00). 



Then we get HLAE^ 5 ^3/24) ^ 
With e = V3/12, 



by substitutmg the relevant terms. See Figure [211 



12l= r isV--'' for re [1,2) 



and 



where 



1 for 7' e [2, oo) 

(r, a/3/12) =^^., (r,^/3/12) ) 



1^1 1 r, \/3/12 
(r,V3/ 



10 - 96 r" + 240 r^" + 192 - 1830 + 3360 - 2650 r'' + 240 r'^ + 1383 r'^ 



1280 r-^ + 540 - 144 r + 35 



405 r*^ 



1^2 



12 = 



10 r^^ - 96 r" + 240 r^" + 192 - 1670 + 2784 - 2650 + 2400 - 1047 



1^3 



12 = 



1280 + 1269 - 144 r + 35 

537 - 683 - 2448 r + 1315 
40576 



405 r'^ 



The corresponding intervals are Ii = [1,3/2), I2 ~ [3/2,2), X3 = [2, 00). See Figure [22] for the graph of 
^l (r, a/3/12) and v (r, \/3/12) . 



Then we get HLAE^ (r, V3/12) 
With e = \/3/21, 



(/x^(r,v/3/12)-^(r))^ 
1^^(^,73/12) 



by substituting the relevant terms. See Figure [211 



MA 



(r, V3/2I 



and 



16200 

2719 r''-5592r''+5760r^- 1536 
8100 

53r''+2744r''-7296r-'+8064 r-3104 
2700 

2719r''-1440r''+2112 
16200 

2401r''-73824 r^ + 153664 r-88548 
16200 r2 

1 _ 89 r--2 
54 



(r, V3/2lj = ^ (r, a/3/21 j I(X, ) 



for 


re [1,8/7) 


for 


re (8/7,3/2) 


for 


re (3/2,12/7) 


for 


re (12/7,7/4) 


for 


re (7/4,2) 


for 


r e [2,00) 



where 



1^1 



V2 



V3 



V4. 



.Vs/2l 



.V3/21 



,V3/21 



,V3/21 



,^3/21 



,\/3/21 



V3/21 



= 1^4124031 r^^ - 22708224 r" - 389826 + 369129408 - 1592672721 

+ 3532359672 - 4721848374 + 4050858048 - 2387433568 + 995033088 

~ 209048784 - 43352064 r + 25952256] / [65610000 r''] , 

= [6594660 r^^ - 31178952 - 14911074 + 441735648 - 1578842961 

+ 3311083512 r - 4669163574 r' + 4366966848 - 2522908768 + 778272768 

- 93443280 + 14450688 r - 8650752] / [65610000 r^] , 

= [826701 r^^ - 7118748 + 14155864 r^" + 18467640 - 104968680 
+ 165877272 r - 128355690 r" + 27338184 + 47304144 r"" - 52684800 
+ 24413592 - 7225344 r + 196608o] / [32805000 r*^] , 

= [826701 r^^ - 7118748 + 14155864 r^" + 18467640 + 20074008 r** 

- 671672808 r ' + 2194076310 - 3382581816 + 2840904144 - 1262284800 
+ 240413592 - 7225344 r + 196608o] / [32805000 r**] , 



= [826701 r^^ - 7118748 r^^ + 14155864 + 18467640 - 137116617 
1511673690 r' + 2773418184 - 



512952192 r - 1511673690 r' + 2773418184 - 2883095856 + 1560115200 
335586408 - 7225344 r + 196608o] / [32805000 r"^] , 



= [826701 r 

+ 125718912 r - 128697690 



- 7118748 



+ 14155864 + 18467640 - 

60695856 - 



91939401 

r-** + 139178184 - 



r + 1966080] / [32805000 r''] , 



52684800 ) 



+ 48413592 - 7225344 

= [226415 r^^ - 1426740 r" + 334536 + 17196648 - 



+ 311364480 r-^ - 711864862 r'^ + 944809880 - 



.. , 684036240 r'' + 

24048504 - 7633920 r + 4761088] / [l0935000 r''] , 



87678147 

238099456 



= [5786907 r-^^ - 42712488 + 76274888 + 51865788 r*^ - 300043296 r' 
+ 132202536 r** + 171413760 - 93614976 + 147517440 - 194460480 r 
+ 67608576 r - 2906112o] / [262440000 r'^j , 

= - [2470629 r-^^ - 25412184 r" + 112001848 r"' - 1958438076 
- 5449924256 + 6150612888 - 55820599296 + 109663683136 r* 



- 97335694848 + 40552466112 - 9825887232 r + 307852320o] / [262440000 r*] , 

433645 - 1765008 r + 929955 
455625 



_ 493829 r'^ - 



The corresponding intervals are Ji = (^l,2VT4/7j, I2 = [2 \/T4/7, 8/7j , I3 = [8/7,5/4), J4 = [5/4,4/3), I5 

[4/3,10/7), le = [10/7,3/2), Ij = [3/2,12/7), Is = [12/7,7/4), Xg = [7/4,2), = [2, 00). See Figure [22] for 
the graph of iia {r, •\/3/2l) and i/a {r, %/3/2l). 

Then we get HLAE^ (r, \/3/2l) = ^^'^^J^(^^^/2i))''''^ substituting the relevant terms. See Figure [211 

Notice that for e = 5 V3/24 and e = V3/12, argsup^>i HLAE'^(r, e) = 1. This result for HLAE can be 
generalized for arbitrary e as follows. 



Figure 21: Hodges-Lehmann asymptotic efficacy against association alternative as a function of r for 
e = a/3/21, 73/12, 5^3/24 (left to right). 




Proposition 2 Let r* := argsup^>i HLAE'^(r, e) and e > ^3/12. Then r* = 1. 

Proof: Recah that HLAE^(r,e) ■ For e G V3/12, VS/s), ^^^(r = l,e) ^ 1 and VA{r = 

1, e) ^ 0. Hence HLAE^(r, e) ^ oo as r ^ 1. So the desired result follows. ■ 

For e G 0, -\/3/12 , it seems that for a while r* = 1 with respect to HLAE. e.g. for e — \/3/21. But for 
sufficiently small e, r* > 1 holds. This can also be seen as e ^ in which case HLAE becomes PAE and the 
optimal value is about 1.006 with respect to PAE. Furthermore, observe that the argsup for HLAE gets closer 

to 1 as e and VAif, e) > for e G ^0, a/3/12^ and v{r, e) gets larger as e — > 0. 

Figure [21] contains a graph of HLAE against association as a function of r for e = 5 ^3/24, ^3/12, ^3/21. 
Notice that since v{r = 1, e) = for e > V3/12, HLAE^(r = 1, e) = oo for e > ^3/12 and linv^oo HLAE^(r, e) = 
0. In Figure [21] we see that, against i?^, HLAE"^(r, e) has a local supremum for some r > 1. Let n be 
the value at which this local supremum is attained. Then n (5 \/3/24) w 3.2323, n (a/3/12) w 1.5676, and 
r; (•\/3/2l) « 1.533. Note that, as e gets smaller, r gets smaller. Furthermore, HLAE"^ (r = 1, -\/3/2l) < oo and 
as e ^ 0, so r becomes the global supremum, and PAE^(r = 1) = and argsup^>]^ PAE^(r = 1) w 1.006. So 
HLAE suggests choosing moderate r when testing against association, whereas PAE suggests choosing small r. 

Derivation of ^aC*", e) and VA[r,e) for association with e = 5V3/24, \/3/12, and a/3/21 are similar to — 
with the supports being the complements of — the corresponding segregation cases. 



In Figure [221 we plot the graphs of mean and asymptotic variance for r G [1,8] under association with 



e = 0, ■\/3/21, -\/3/12, 5 ■\/3/24. Notice that HAir, e) gets smaher as e gets larger at each r which is in agreement 
with the /i^(r, e) expressions in Appendix 2. However, the same ordering does not hold for :^^(r, e) at each 
For small r the ordering is same as in /iyi(r, e), but for large r the ordering is reversed. Furthermore, 
supj.g[]^ i^Air, e) seems to decrease as e increases while argsup^gjj^ i^Air, e) seems to increase as e increases. 



3.9 Asymptotic Power Function Analysis 



The asymptotic power function (see, e.g., Kendall and StuartI 1 1979f l) can also be investigated as a function of 
r, n, and e using the asymptotic critical value and an appeal to normality. 



3.9.1 Asymptotic Power Function Analysis Under Segregation 

Under segregation, for sufficiently large n, we reject Hq when ^Jn ( P"*^'')~^s('-,e) \ ^ Z(i_a) where Z(i_a) is the 

(1 — a) X 100 percentile of the standard normal distribution, e.g. with a ~ .05, z.95 ~ 1.645. Then size a critical 
region for large samples is 

Pn{r) > p{r) + Z(i_a) ■ \Jv{r)ln. 
Under a specific segregation alternative Uf ^ the asymptotic power function is given by 



1 - $ 



\/vs{r, e) V'^S'(?', e) 



With e = ns(r, n, e) at level a = .05 is plotted in Figure [H Observe that ns(r, n, ^3/8) ^ as r 4 

for n = 5, 10, 15. Let r* (n, e) be the the value at which ns(r, n, e) attains its global supremum and r\ {n, e) be the 
the value at which Ils{r, n, e) attains its local supremum. Then r*(5, 73/8) 1.260, r;(10, V3/8) « 1.3741 and 
(10,^3/8) « 2.3818, r* (15,^3/8) « 3.3724 and r;*(15, ^3/8) « 1.45, r* (20, a/3/8) = 4 and r,*(20, V3/8) « 
1.5. Finally, r*g{n, a/3/8) = 4 for = 20, 50, 100 and Usir, n, a/3/8) has a hump for n = 10 and n = 15. 

With e = \/3/4, n5(r, n, e) at level a ^ .05 is plotted in Figure [24l Observe that ns(r, n, V3/4) ^ 1 as 
r ^ 2 for n = 3, 5. Moreover, r*{n, \/3/4) = 2, for n = 3, 5. 



as 



With e = 2 V3/7, ns(r, n, e) at level a = .05 is plotted in Figure UHl Observe that Iis{r, n, 2 \/3/7) 1 
r ^ 3/2 for n = 3, 5 and r*(n, 2 ^3/7) 2, for n = 3, 5. 

3.9.2 Asymptotic Power Function Analysis Under Association 

Under association, for sufficiently large n, we reject Hq when y/n j £iL(ltii4(l£) ) < where Za is the a x 100 

V yi^A(r,e) / 

percentile of the standard normal distribution, e.g. with a ~ .05, z.05 ~ —1.645. Then size a critical region for 
large samples is 

Pn{r) < p{r) + Za ■ \/v(r)ln. 



Under , we have 



nA(r, n, e) := p(^p„ {r) < p.{r) + Za ■ ^/iy{r)/'nj 



\/vA{r, e) \/vA{r, e) 



With e = \/3/21, n^(r, n, e) at level a = .05 is plotted in Figure^ Observe that U^ (r, n, \/3/2l) .057 
as r — > 00 for n = 5, 10, 100. Let r(n, e) be the value at which 11^ (r,n,e) attains its supremum. Then, 




Figure 24: Asymptotic power function against segregation alternative H^^^ as a function of r for — 3 (left) 
and n ~ 5 (right). 



Figure 25: Asymptotic power function against segregation alternative H^^^ as a function of r for n ~ 3 (left) 
and n = 5 (right). 



Figure 26: Asymptotic power function against association alternative as a function of r for n = 5, 10, 100 





Figure 27: Asymptotic power fimction against association alternative as a function of r for n = 5, 10, 100 



f(5,V3/2l) w 2.01, and f(l0,\/3/2l) « 1.875, and f (100,^3/21) w 1.645. Moreover, (r, 100, ^/3/2l) 
attains a local infimum at « 1.065. 

With e = V3/12, n^lr-, n, e) at level a = .05 is plotted in Figure[271 Observe that Ua {r, n, V3/12) .0766 
as r ^ oo for 71 = 5, 10, 100. Moreover, f (5, V3/12) w 1.99, f (lO, \/3/12) w 1.75, and f (lOO, %/3/12) w 1.60. 
Moreover, Ha (?', 100, \/3/2l) attains a local infimum at w 1.105. 

With e = 5 \/3/24, nA(r, n, e) at level a = .05 is plotted in Figure [281 Observe that 11 {r, n, 5 %/3/24) 1 
as r ^ 2 for = 5, 10. 



Figure 28: Asymptotic power function against association alternative -^^^3/34 ^ function of r for n = 5, 10. 



Figure 29: Realization of segregation (left), Hq (middle), and association (right) for |3^| = 10, J = 13, and 
n = 100. 



3.10 Multiple Triangle Case 



Suppose 3^ is a finite collection of points in M.'^ with |3^| > 3. Consider the Delaunay triangulation (assumed to 
exist) of y, where Tj denotes the j*'' Delaunay triangle. J denotes the number of triangles, and Cniy) denotes 
the convex hull of y. We wish to test 

Ho:Xi'^U{CH{y)) 
against segregation and association alternatives. 

Figure [29] and Figure [30] are graphs of realizations of n — 100 and n = 1000 observations which are 
independent and identically distributed according to U{CH{y)) for |3^| = 10 and J = 13 and realizations of 
n ~ 100 and n ~ 1000 observations under segregation and association for the same 3^, respectively. 




Figure 30: Realization of segregation (left), Hq (middle), and association (right) for |3^| = 10, J = 13, and 
n = 1000. 

The digraph D is constructed using Ny,{-) as described in Section [2731 here for Xi e Tj the three points 
in y defining the Delaunay triangle Tj are used as 3j . Let p„ (r, J) be the relative density of the digraph 
based on Xn and y which yields J Delaunay triangles, and let Wj = A(Tj)/A{CH{y)) for j = 1, . . . , J, where 
A{CH{y)) = ^(^i) with A{-) being the area functional. Then we obtain the following as a corollary to 

Theorem 2. 

Corollary 1 The asymptotic null distribution for p„(r, J) conditional on W = {wi, . . . , wj} for r E [1, oo) 
is given by N{^J,{r, J),v{r, J)/n) provided that i>{r, J) > with 

J J 
fi{r, J) fi{r) ^ and iy{r, J) := iy{r) ^ Wj + 4^(r)^ 

where fj,{r) and iy{r) are given in Equations ([S]) and ([5]), respectively. 
Proof: See Appendix 5 for the proof ■ 



J 




(14) 



r 


1 


11/10 


6/5 


4/3 


^/2 


3/2 


2 


3 


5 


10 


n = 100, N = 1000 


as{n, J) 


.144 


.141 


.124 


.101 


.095 


.087 


.070 


.075 


.071 


.072 




.191 


.383 


.543 


.668 


.714 


.742 


.742 


.625 


.271 


.124 


J) 


.118 


.111 


.089 


.081 


.065 


.062 


.067 


.064 


.068 


.071 


/?^(r,V3/12, J) 


.231 


.295 


.356 


.338 


.269 


.209 


.148 


.095 


.113 


.167 


n ^ 200, N = 1000 


Ss(?T., J) 


.095 


.092 


.087 


.077 


.073 


.076 


.072 


.071 


.074 


.073 


/?~^(r,V3/8, J) 


.135 


.479 


.743 


.886 


.927 


.944 


.959 


.884 


.335 


.105 


aA(n, J) 


.071 


.071 


.062 


.057 


.055 


.047 


.038 


.035 


.036 


.040 


/?;^(r,^/3/12, J) 


.182 


.317 


.610 


.886 


.952 


.985 


.972 


.386 


.143 


.068 


n = 500, iV = 1000 


asin, J) 


.089 


.092 


.087 


.086 


.080 


.078 


.079 


.079 


.076 


.081 


/3'^(r,V3/8,J) 


.145 


.810 


.981 


.997 


.999 


1.000 


1.000 


1.000 


.604 


.130 




.087 


.085 


.076 


.075 


.073 


.075 


.072 


.067 


.066 


.061 


'0^{r,V3/12,J) 


.241 


.522 


.937 


1.000 


1.000 


1.000 


1.000 


.712 


.187 


.063 



Table 7: The empirical significance level and empirical power values under H^^ and H\- , N = 1000, 

v3/S y o I Yl 

n = 100, and J = 13, at a = .05 for the realization of y in Figure [501 



By an appropriate application of Jensen's inequality, we see that '^^j — 



Therefore, the 



covariance v(r, J) = iff both v{r) = and X]j=i = ' 
even when v{r) = 0. 



hold, so asymptotic normality may hold 



Similarly, for the segregation (association) alternatives with 4 e^/3x 100% of the triangles around the vertices 
of each triangle is forbidden (allowed), we obtain the above asymptotic distribution of /0„(r, J) with /x(r, J) being 
replaced by /i(r, J, e), v(r, J) by vir, J, e), /^(r) by /x(r, e), and v{r) by vir, e). 

Thus in the case of J > 1, we have a (conditional) test of Hq : Xi *~ U{CH{y)) which once again rejects 
against segregation for large values of p„(r, J) and rejects against association for small values of Pnir, J)- 

The segregation (with 5 — 1/16 i.e. e = \/3/8), null, and association (with (5 = 1/4 i.e. e = \/3/12) 
realizations (from left to right) are depicted in Figure [29] with n = 100 and in Figure [30l with n = 1000. For 
both n = 100 and n = 1000, for the null realization, the p-value is greater than 0.1 for all r values and both 
alternatives. For the segregation realization, with n = 100 we obtain p < 0.001 for r < 3, p = 0.025 for r = 5, 
and p > 0.1 for r > 10 and with n = 1000 we obtain p < 0.0031 for 1 < r < 5 and p > 0.24 for r = 1 and 
r > 10. For the association realization, with n = 100, we obtain p < 0.05 for r ~ 1.5, 2, and p > 0.06 for for 
other values of r and with n = 1000, we obtain p < 0.0135 for 1 < r < 3, p = .14 for r — 1, and p > 0.25 for for 
r > 5. Note that this is only for one realization of 

We implement the above described Monte Carlo experiment 1000 times with n = 100, n = 200, and n ~ 500 
and find the empirical significance levels as{n,J) and aAin, J) and the empirical powers /3,f(r,^/3/8, J) and 

(r, •\/3/12, J). These empirical estimates are presented in Table[7|and plotted in Figures [311 and [32l Notice 
that the empirical significance levels are all larger than .05 for both alternatives, so this test is liberal in rejecting 
Hq against both alternatives for the given realization of y and n values. The smallest empirical significance 
levels and highest empirical power estimates occur at moderate r values (r = 3/2, 2, 3) against segregation and 
at smaller r values (r ~ ^/2, 3/2) against association. Based on this analysis, for the given realization of y, we 
suggest the use of moderate r values for segregation and slightly smaller for association. Notice also that as n 
increases, the empirical power estimates gets larger for both alternatives. 

Remark The conditional test presented here is appropriate when Wj G W are fixed, not random. An 
unconditional version requires the joint distribution of the number and relative size of Del aunay triang l es wh en 
y is, for instance, a Poisson point pattern. Alas, this joint distribution is not available fsee [Okabe et al.l 

□ 



Figure 31: Monte Carlo power using the asymptotic critical value against H^^^, as a function of r, for n ~ 100 

(left), n ~ 200 (middle), and n = 500 (right) conditional on the realization of y in Figure [551 The circles 
represent the empirical significance levels while triangles represent the empirical power values. 




Figure 32: Monte Carlo power using the asymptotic critical value against H^^^^ as a function of r, for 

n = 100 (left), n ~ 200 (middle), and n ~ 500 (right) conditional on the realization of y in Figure [501 The 
circles represent the empirical significance levels while triangles represent the empirical power values. 



3.10.1 Related Test Statistics in Multiple Triangle Case 



For J > 1, we have derived the asymptotic distribution of J) = („ (^i)) ■ ^et Aj be the number of arcs 
and prij (r) be the relative density for triangle Tj and nj :~ \Xn H Tj|, for j = 1, . . . , J. So 



nj (»j - 1) 
^ n (n — 1) 

7 = 1 ^ ^ 



E 



Pn, W = Pnir, J), 



smce 



^-^ n{n — \) 

.7 = 1 ^ ' 



\A\ 



n (n — 1) n (n — 1) 



Pn{r, J). 



Let [/„ 



p„^(r) where = A(rj)/A(Ci/(3^)). Since Puji''') asymptotically independent, 
(f^n ^ >^)) sind {Pn{r, J) — p{r, J)) both converge in distribution to Af{0, v{r, J)). 

In the denominator of pn{r,J), we use n{n — 1) as the maximum number of arcs possible. However, by 
definition, we can at most have a digraph with J complete symmetric components of order Uj, for j = 1, . . . , J. 
Then the maximum number possible is nt := J2j=i the (adjusted) relative density is p'^j{r) ^ 

and p-*(r) = ^J:^ = E/=i ^^^^^ Pn, (r). Since ^^^^^ > for each j, and 



"j (nj-l) _ -, adj 



is a mixture of pn.ir)- Then E p'^'^j{r) = /^(r, J) and the asymptotic variance of Pn^'jir) is 



adj 



1, 




4mmME-?/ E-l -1 



.□ 



3.10.2 Asymptotic Efficacy Analysis for J > 1 

The PAE, HLAE, and asymptotic power function analysis are given for J = 1. For J > 1, the analysis will 
depend on both the number of triangles as well as the relative sizes of the triangles. So the optimal r values 
with respect to these efficacy criteria for J = 1 do not necessarily hold for J > 1, so the analysis need to be 
updated, given the values of J and W. 

Under segregation alternative Hf , the PAE is given by 

2 



Under association alternative Hf the PAE is similar. 

In Figure [33l we present the PAE as a function of r for both segregation and association conditional on the 
realization of y in Figure [30l Notice that, unlike J = 1 case, PAE5(r) is bounded. Some values of interest are 

PAEf(p„(l)) = .3884, lim,,^^PAEf(r) = ^^.y^ « 139.34, argsup,g[i,2] PAE5(r) « 1.974. 

As for association, PAE;^(r = 1) = 422.9551, lim,.„,oo PAE;^(r) = 0, argsup^>i PAE;^(r) « 1.5 with PAE;;^(r = 
1.5) « 1855.9672. Based on the asymptotic efficacy analysis, we suggest, for large n and small e, choosing 
moderate r for testing against segregation and association. 

Under segregation, the HLAE is given by 

HLAEf(.,.,:=1^5J(ii#^%2)! = (>'^(^.')(£:^."^)-;'W(£t.''^))' 




Figure 33: Pitman asymptotic efficacy against segregation (left) and association (riglit) as a function of r witli 
tlie realization of y in Figure [29] Notice tliat vertical axes are differently scaled. 
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r r r 

Figure 34: Hodges-Lehmann asymptotic efficacy against segregation alternative Hf as a function of r for 
e = \/3/8, \/3/4, 2 \/3/7 (left to right) conditional on the realization of y in Figure [29l 

Notice that HLAE5(r, e = 0) = and lim_oo HLAE5(r, e) = 0. 

We calculate HLAE of Pn{f^ J) under for e = a/S/S, e = "\/3/4, and e = 2 \/3/7. In Figure[34]we present 
HL AE J (r, e) for these e values conditional on the realization of y in Figure [29l 

Note that with e = V3/8, HLAEf (r = 1, ^/3/8) w .0004 and argsup^gf^^] HLAE^ (r, ^/3/8) « 1.8928 with 
the suprcmum w .0544. With e = \/3/4, HLAEf (r = 1, V3/4) w .0450 and argsup^g^i^^j HLAE^ (r, V3/4) w 
1.3746 with the supremum « .6416. With e = 2 V^/7, HLAE^ (r = 1, 2 V3/7) « .045 and argsup^g[i.„„] 
HLAEf (r, 2 ^3/7) « 1.3288 with the supremum « .9844. Furthermore, we observe that HLAEf (r, 2 V3/7) > 
HLAE^ (r, V3/4) > HLAEf (r, V3/8) at each r. Based on the HLAE analysis for the given y we suggest 
moderate r values for moderate segregation, and small r values for severe segregation. 

The exphcit form of HLAE j(r, e) is similar which implies HLAEj (r, e = 0) = and limr_»oo HLAEj (r, e) = 

0. 

We calculate HLAE of p„(r, J) under for e = ^3/21, e = \/3/12, and e = 5 \/3/24. In Figure [35] we 
present HLAE5(?', e) for these e values conditional on the realization of y in Figure [30] 

Note that with e = %/3/21, HLAE;^ (r = 1, V3/21) w .0009 and argsup^gf^ HLAE;^ (r, V3/21) w 1.5734 
with the supremum sa .0157. With e = \/3/12, HLAEj (r = 1, V3/12) « .0168 and argsup^gfi^j 
HLAE;^ (r, V3/12) « 1.6732 with the supremum « .1818. With e = 5 V3/24, HLAE;^ = 1, 5 V^/24) « .0017 
and argsup^gM HLAE;^ (r, 5 ^3/24) w 3.2396 with the supremum w 5.7616. Furthermore, we observe that 
HLAE;^ (r, 5 \/3/24) > HLAE^^ (r, \/3/12) > HLAE;^ (r, %/3/2l) at each r. Based on the HLAE analysis for 



Figure 35: Hodges-Lehmann asymptotic efficacy against association alternative as a function of r for 
e = \/3/2f , V3/f 2, 5 V3/24 (left to right) conditional on the reahzation of y in Figure!^ 



the given y we suggest moderate r values for moderate association and larger r values for severe association. 



4 Discussion 



The extension to R'' for d > 2 is straightforward. See ICevhan and Prie bi (|2003bl) for more detail. Moreover, 
the geometry invariance, asymptotic normality of the {/-statistic and consistency of the tests hold for d > 2. 

The first proximity map similar to the r-factor proximity map Ny in literature is the spherical proximity 
map Ns{x) := B{x,r{x)), (see the references for CCCD in the Introduction). A slight variation of Ns is the 
arc-slice proximity map N Afi(x) := B{x,r{x)) D T{x) where T{x) is the Delaunay cell that contains x (see 
Cevhan and Priebe ( 2003a )) . Furth ermore, Ce yhan and Priebe introduced the central similarity proximity map 
Ncs in Cevhan and Prieba ( 2003a ) and Ny in lCevhan and Priebe ( 20033) ■ The r-factor proximity map, when 
compared to the others, has th e advantages that the asymptotic distribution of the domination number ^n{Ny) 



is tractable (see lCevhan and Priebe (^2003bi)), the exact minimum dominating sets can be found in polynomial 



time. Moreover Ny and Ncs are geometry invariant for uniform data over triangles. Additionally, the mean 
and variance of p„ is not analytically tractable for Ns and Nas- While Ny{x), Nc's{x), and Nas{x) are well 
defined only for x G C^f (3^), the convex huU of y, Ns{x) is well defined for aU x G M''. Ng and Nas require no 
effort to extend to higher dimensions. 



There are many tests available for segregation and association in literature. See iDixonI (jl994f ) for a survey 
on these tests and relevant references. The most prevalent of these tests are Pielou's test of independence 
and Ripley's test based on K{t) and L{t) functions. However, the test we introduce here is not comparable to 
either of them, since it is a conditional test — conditional on a realization of J = |3^| and W and we require 
the number of triangles J is fixed and relatively small compared to n = \Xn\- The null hypothesis for testing 
spatial patterns has two major forms: 



(i) assuming random labeling of locations, i.e. spatial randomness does not necessarily hold, as in Pielou's 
test which only tests for the association between classes, 

(ii) assuming not only random labeling but also complete spatial randomness, that is, each class is distributed 
randomly throughout the area of interest, as in Ripley's test. 



Our conditional test is closer to the latter in this regard. 

The test based on the mean domination number in ICevhan and Priebe (l2003bh is not a conditional test, 
but requires both n and number of Delaunay triangles J to be large. The comparison for a large but fixed J is 
possible. Furthermore, under segregation alternatives, the Pitman asymptotic efficacy is not applicable to the 
mean domination number case, however, for large n and J we suggest the use of it over arc density since for 
each e > 0, Hodges-Lehmann asymptotic efficacy is unbounded for the mean domination number case, while 
it is bounded for arc density case with J > 1. As for the association alternative, HLAE suggests moderate 
r values which has finite Hodges-Lehmann asymptotic efficacy. So again, for large J and n mean domination 



number is preferable. The basic advantage of p„(r-) is that, it does not require J to be large, so for small J it 
is preferable. 
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Appendix 1: Derivation of /i(r) and z/(r) 

In the standard equilateral triangle, let yi = (0, 0), y2 = (1, 0), ys = (1/2, \/3/2), Mc be the center of mass, Mj 
be the midpoints of the edges ej for j = 1, 2, 3. Then Mc = (1/2, %/3/6), Mi ^ (3/4, V3/4), M2 = (1/4, V3/4), 
M3 = (1/2,0). 

Recah that E[p„(r)] = ^^J2J:,<, E^] = iE[/ji2] = /i(r) = P{X, e iVJ(X,)). 

Let Xn be a random sample of size n from U {T{y)). For xi = {u, v), ir{xi) = r v + r \/3u — ^/Sx. Next, let 
Ni := ir{xi) n 63 and N2 := irixi) D 62- Then for zi e := T(yi, M3, Mc), iVj}(zi) = T(yi, iVi, iVz) provided 
that ir{xi) is not outside of T{y), where 

Ni = (r {yi + V3xi)V3/3,0) and N2 - (r (yi + VSxi) ^3/6, {yi + x/3xi)r/2). 



Derivation of yu(r) in Theorem 2 

Now we find /i(r) for r G [1, 00). Observe that, by symmetry, 

^lir) = p(X2 e iVj(Xi)) = 6P(X2 e 7Vj(Xi),Xi g r,). 

Let is{r,x) be the line such that r d{yi,is{r,x)) ~ d{\/i,ei), so £s{r,x) = \/3(l/r — x). Then if .xi e Tg is above 
4(?',a;) then iVJ(xi) = T(3^), otherwise, iV;5)(xi) C T{y). 

For r G [1, 3/2), £,{r, x)r\T, = 0, so N^{x) C T(3^) for aU x e T^. Then 

r^-^A(iV^(xi)),^.,^ __ , ^ 37 ,\ _ 37 



^l{r) ^ 6 P{X2 e iVJ(Xi), e T,) = 6 ^-^^dydx = 6 

where A{N^{xi)) ^ {y + V^xf and A(r(3^)) = ^3/4. 



1296 / 216 



For r e [3/2, 2), £s{,r, x) crosses through M^M(j. Let the x coordinate of ^s(r, x)C\yiM(j be si, then si = 
See Figure [551 



Then 



N r^/^^(^v(a;i)) /"^/^ A(iV;;(a;i)) 

1/2 /.x/ys ^ -36 + r4 + 64r-32r2 



^ J,^(r,.)A{T{y)) 48 r2 

Hence for r e [3/2,2), ^(r) = -ir^ - Sr"! + | + 4. 



For r e [2, 00), i's(?', x) crosses through yiA/3. Let the x coordinate of isir, x) C\ yiA/3 be S2, then S2 = l/f 
See Figure [36l 



Then 

'•^i Wy3^(^VJ(xi)) p ^^.(r,x) A{N^{xi)) 

^(T(3;))^^^^^ + ^ y„ A(r(3;)) 



p(X2GivMx,),x,eT.)= / / ^p^dydx+ / ;,/2^ ^^/^^ 



5/73 ^1/2 ^2:/y3 -[^ 



-3 + 


2r2 
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Derivation of z/(r) in Theorem 2 

To find Cov[/ii2, /113], we introduce a related concept. 

Definition: Let {il, A1) be a measurable space and consider the proximity map N : ^Ix p(ri) p{^), where 
p(-) represents the power set functional. For B C il, the Ti-region, Ti{-) ~ Fi(-, A^) : 51 — > p{fl) associates the 
region Fi(i3) := {z € : B C N{z)} with each set B C For x e fi, we denote Fi({a;}) as Fi(x). Note that 
Fi-region depends on proximity region N(-). 

Furthermore, let ri{-, Ny) be the Fi-region associated with Ny{-), let Aij be the event that {XiXj e A] = 
{X, e Arj(X,)}, then % = J(A„) + /(A,,)- Let 

P2V ^^({^2,^3} C iVJ(Xi)), P;, := P(X2 G ^J(Xi),X3 G Fi(Xi,iVJ), PJ^ - ^({^2,^3} C Fi(Xi,7VJ)). 
Then Cov[/ii2, /iia] = E[/ii2 /113] - E[/ii2]E[ft,i3] where 

E[/li2/ll3] = E[(I(Ai2)+I(yl2l))(I(Ai3)+I(A3l)] 

= P(Ai2 n A13) + P(Ai2 n A31) + P(A2i n A13) + p(A2i n A31). 

= p({X2,X3} c 7Vj(Xi)) + 2P(X2 e iVJ(Xi),X3 e Fi(Xi,iVj)) + P({X2,X3} c Fi(Xi,7Vj)) 

= ^2Af + 2 Pm + -PJg- 
So :/(r) = Cov[/ii2, /113] = (PJat + 2 PJ, + PJ^) - [2 

Furthermore, for any xi = [u,v) G T{y), Ti{xi,Ny) is a convex or nonconvex polygon. Let ^j(r, be the 
line between xi and the vertex parallel to the edge Cj such that r d{yj,^j{r, x)) = d{yj,£r{xi)) for j = 1, 2, 3. 
Then Ti{xi,Ny) n P(yj) is bounded by ^j{r, x) and the median lines. 

For xi = {u, v), ^i(r, x) = —\/3x + {v + ^/3u)/r, ^2ir, x) = {v + VSr {x — 1) + \/3(l — u))/r and ^^{r, x) = 
(\/3(r- 1) + 2w)/(2r). To find the covariance, we need to find the possible types of Fi {xi,Ny) and Ny{xi) 
for r G [1, 00). 

We partition [l,oo) with respect to the types of Ny{xi) and Fi {xi^Ny) and obtain [1,4/3), [4/3,3/2), 
[3/2,2), [2,cx)). 

For r G [1,4/3), there are six cases regarding Fi [xi^Ny^ and one case for Ny{xi). See Figure [571 for the 
prototypes of these six cases of Fi (a;i,7V^). Each case j, corresponds to the region Rj in Figure |38l where 



lam{x)=xly/l, gi(.T) = (2r + 3.T-3)/y3, 92(2;) = \/3 (1/2 - r/3), 

93(2:) = y3(x-l + r/2), q^{x) = ^/Z{\|2-r|^), 912(0;) = ^/3 (r/2 - cc) and si = 1 - 2 r/3, S2 = 3/2-r, 53 = 
1 — r/2, S4 = 3/2 — 5 r/6, S5 = 3 7-/8. The explicit forms of j = 1, . . . , 6 are as follows: 




Figure 37: The prototypes of the six cases of Fi {xi.Ny) for x\ £ Tg for r e [1,4/3). 



i?i = {{x,y) e [Q,si] X [0,4m (a;)] U [si,S2] x [qi{x),£am{x)]}, 

i?2 = {ix,y) € [si,S2] X [0,qi{x)] U [s2,S3] X [0,92(2:)] U [53,54] x [g3(x), 92(2;)]}, 

^3 = {(a:,J/) G [53,54] X [0, 93(2)] U [54, 1/2] x [0,92(2)]}, 

i?4 = {(a:,?;) e [51,52] X [0,91(0:)] U [54,55] X [93(2:), 4m (x)] U [55, 1/2] x [93(2), 912 (2)]}, 

i?5 = {ix,y) e [54,1/2] X [92(0:), 93(2;)]}, Re = {(a;,?;) e [55,1/2] x [912 (2), 4™ (2;)]}. 



By symmetry, PJ^ = 6P({X2,X3} C N^iX,), X, e T,] 
For r e [1,4/3), 



116640 



where ^(A^J(a;i)) = ^ (y + ^3 2)^. Hence for r G [1,4/3), P^V = '^m- ^^^"^ '^'^^'^ results also 

hold for r G [4/3,3/2). 




Figure 38: The regions corresponding to the prototypes of the six cases for r E [1, 4/3) with r = 1.25. 



Next, by symmetry, PJ^ = 6P({X2,X3} C Ti (Xi,iVj) , X, e T,), and 

6 

p{{X2,X3} c Ti (Xi,7Vj) , Xi e r,) = ^p({X2,X3} c Ti (Xi,ivj) , e r, 



For Xi G 



«i /■^<.'»(-)^(ri(xi,7Vj;))' 

A(ri (xi,iV];))^ _ (211 - 1716 r3 + 5751 r2- 6696 r + 2511)(2r- 3)2 



P({X2, X3} C Fi (Xi, iVJ) , e i?i) = -^^^Jl-^dydx 



's, J,,i.) A{T{y)r ''^^'^'^ 10935 r4 

where ^(Fi [xuN^y]) = ^ 

For a-i e i?2, 

^3 r^^^AiT,i.,,Nr)f^^^^^ p r'^^^t^dyd^ 



Is, Jo MTiyw " Js, J,,i-.) A{ny)Y 

(2 r - 3) (440 - 4091 + 13476 - 16506 r + 6696) 
972073 ' 

where A (Fi (a;i,iVJ)) = — ^ ^27^ -■ 

For xi e P3, 

P ({^2, X3} C Fi (Xi, iVJ) , Xi e P3) 

(2 r - 3)(21056 - 7845 r"* + 231300 - 943650 + 1127520 r - 428652) 

262440 r4 ■ 

1, /I /-n / ^rI■^^ (2 y^+2 ^3 y+3-6 a:+6 2-^-2 r^) 

where A (Fi (.Ti, AT^)) = ^ '-. 



For xi G i?4, 



12873091 2 81239 14714 4238 _2 656 128 77123 

= r H rH r r H r ' r . 

699840 648 27 9 3 3 216 

1 W-rn ^ Arr\\ ^3 (9 + 18- 24 r+4 ^3 r 18 x+6 a:^ + 14 + 12 r x-8 a; ^3 y-6 %/3 y ) 

where A [Ti {xi,N^)) = — -■ 

For a-i e i?5, 

.1/2 .93(x) ^('p ^r^^2 

P ({^2, ^3} c Fi (Xi, , e i?5) = / / MTnA^3 <^yd^ 

(89305 - 364080 + 598320 - 468288 r + 145152)(-6 + 5 rf 

262440 r4 " 

1 A r-^ r i^rrW \/3(9r^ + 18-24r+4\/3ry-18x+6a:^ + 14jj^ + 12rx-82:\/3y-6\/3y) 

where A [Ti [xi, N^)) = — j2^3 ^ . 

For xi e Re, 

^1/2 /■^o^(^) A(Fi (xi,jV^))" 



p({X2, Xs} c Fi (Xi, ,x,eRe) = J j \4(r(y))3 ^^^^ 

(1081 - 4672 + 7624 - 5568 r + 1536)(-4 + 3 rf 



960 r4 



where A {T, (xi,iVj)) = _ v^(v^i^-2^-^3+3.+4,-3.--3,-) 



2r^ 

So 



p({X2,X3}cFi(Xi,ivj;)) = 6 



/ 25687 2 133 14 , 1 2 1 4 1 

r ^ r ^ r r ^ 

V349920 972 81 9 90 324, 

25687 - 47880 - 1080 + 60480 - 38880 + 3888 



58320 r4 

Furthermore, by symmetry, =QP{X2 G iVJ(Xi), X3 G Fi (J*^i,A^3;) , G T,), and 



6 



P(X2 G N^{X^), Xs G Fi (XuN^) , Xi£T,)^Yl ^(^2 G iVJ (Xi), X3 G Fi (Xi, 7V^) , X^ G i?,). 



For Xi G i?i. 



,"1 /■^<""(^) A(Af{;(xi))^(Fi (a;i,Ar^)) 
P(X2 G N^{X,), X, G Fi {X,,N^) , G Pi) = / / ^(T(y))3 ^ ^'^^ 

2 -7/1/1 „ I c;cc\i'o„ o\4 



"'0 

A(iV^(xi))v4(Fi (xi,jVj;))^ _ 1 



^(r(3^))3 ^ 21870 



dydx = - — — — (143 - 744 r + 558) (2 r - 3) 



For xi G P2, 

r3 ^(iv^(.o)^(r (.,iv^)) J A(iVj;(.o)A(r.(...iv^)) 

= 7777^ r (2r - 3)(23014r'* - 187311 + 517896 - 594216 r + 241056). 
349920 



For xi e R3, 



P{X2 e N^iXi), Xs G Ti (Xi,iVj) , Xi G R3) 

1 



1049760 

For Xi £ i?4, 



(2 r - 3) (874 r - 297327 + 1858392 r'' - 4298832 r"" + 4280202 r - 1546209) . 



P{x, e N^{X,), Xs e Fi Xi, iVJ , x, e R,) = / IVTnms ^ ^/^^ 

■ r (1762560 r - 497664 - 2661120 + 201395 - 1017720 r'* + 2212560 r^). 



466560 
For xi e i?5, 

-dydx 



P{X2 e iVj(Xi), X3 e Fi (Xi,iVj) , x^eR^)^ / 



^ (1570/ - 1380 - 11205 + 29700 r - 19116) (-6 + 5 r)^ 



/"^aCx) A(jVj)(.Ti))A(Fi (xi,iV j))) 
AiT{y)f 



262440 
For xi G i?6j 

/•^- (^) A(iV^ (xi)) yl (Fi (xi, iV^)) 



A(r(y))3 



— (1485 r'' - 2064 + 16r^ - 128 r + 768)(-4 + 3r)^ 



51840 
Thus 

P{X,eNriX,),X,er, (Xi,iVj)) = 6 ( - J,r^ + + Ir^ _ if,. + ^) 

^ 3007 6 16 5 I 5 4,2 3 133 2 , 112 143 

116640 135 ^ 108 3 108 135 720 ' 

Hence 

E[hi2 his] = [3007 r^" - 13824 + 7743 + 77760 r'^ - 1 17953 + 48888 - 24246 r* + 60480 

- 38880 + 3888] / [58320 r"*] . 



Thus 



iy{r) 



3007 r^" - 13824 + 898 + 77760 - 117953 + 48888 r'^ - 24246 r'^ + 60480 - 38880 



-3888 



58320 



For r £ [4/3, 3/2), there are six cases regarding Fi (^xi,Ny) and one case for Ny{xi). Prototypes of the five 
of the cases for Fi (xi, Ny) are as in case— j for j = 1, . . . , 5 in Figure [37] and the new case, case-7, is depicted 
in Figure [39l Each case j corresponds to the region Rj in Figure l40l where si = 1 — 2 7-/3, S2 = 3/2 — r, S3 = 
1 — r/2, S4 = 3/2 — 5r/6, S5 = 3/2 — 3r/4. The exphcit forms of j = 1, 2, 3 are same as before, for j = 4, 5, 7 
are given below: 

Ri = {{X,y) e [S2,S4] X [q2{x),£am {x)]U[s4,S6] X [53(2;), ^am (a;)]} 

T R5 = {{x,y) e [S4, se] X [q2{x),q3{x)] U [se, 1/2] x [52(2;), 94(3;)]} 

Rr = {ix,y) G [s6,l/2] X [q^x) , £a,n (x)]} 




^ ^ 0.1 ' ' ' 0.2 ' St cTs^ ' ' O 'i c~ ' ^ 

Figure 40: The regions corresponding to the six cases for r e [4/3, 3/2) 



where 4m (x) = x/Vi, qi{x) = (2 r - 3)/V3 + V3x, = ^3 (1/2 - r/3), = V3 (x - 1 + r/2), and 

94(2;) = \/3 (1/2 -r/4). 

Then Pjtv ^ llilo ■ ^'^^ same Umits of integration in ^(r) calculations with the integrand A{Ny[xi)Y I A{T{y))'^ 

Next, by symmetry, PJ^ 6 P({X2, X3} C Ti (Xi, iVj) , Xi € T,) , and, let Sj := {1, 2, 3, 4, 5, 7}, then 

p({X2,X3} c Ti (Xi,7Vj) , Xi e T,) = ^ P({X2,X3} c Ti (Xi,iVj) , Xi e p,) . 

For xi e Pj, j = 1, 2, 3 we get the same result as before. 
For xi e P4, 

P({X2, X3} c Fi (Xi, , e P4) = / / ^^AJj_^dyd^ 



S6 /•^-"(^) A(Fi (a::i,iV{;))\ , 9637 - 89640 + 288360 - 362880 r + 155520 

— . , , — , , :; dyax = . 

«3(x) MT{y)Y 349920 r2 

, w-rn / ArrW ^3 (9 + 18- 24 r+4 ^3 r y- 18 x+6 aj^ + M y ^ + 12 r x-S a; ^3 y-6 ^3 y) 

where A (Fi (.Ti,iVJ)) = — ^ j^T^ -■ 



For xi e i?5, 



p ({X2, X3} c Ti (Xi, ivj) , e i?5) 
-6 A(ri (.Ti,7V5))" 



1/2 /-ww ^(ri(.Ti,ivj))' 



SB Jq2(x) 



A{T{y)f 



■dydx 



87251 + 13219200 r - 11214720 - 5225472 + 3377160 - 261288 



2099520 r3 



where A (Ti (xi, A'j)) is same as before. 
For xi e i?7, 



P {{X„ X,} c Fi (Xi, , X, e Rr) = ///^ 



1/2 (2;) A(r,{xuN-)f 



S6 Jqi{x) 

(57 r''+96 r^-72 r^-576 r+512)(~4+3 r)^ 
2880 r-4 



A(T(J^))3 



dydx 



where A (Fi (a;i,Af3;)) 
So, 



pr 



Furthermore, 



\/3 (6 1/^-2 \/3 iy+6-6 a:+6 r^) 

12 7-2 ■ 



/ -47880 - 38880 + 25687 - 1080 + 60480 + 3888 
I 349920 r4 ^ 

-47880 - 38880 + 25687 - 1080 + 60480 + 3888 

58320 r4 ' 



For a"! G = 1, 2, 3 we get the same result as before. 
For a-i e i?4, 

P {X2 e (Xi), X3 e Fi (Xi, , Xi G P4) 

..4 (X) A{N^{xi))A{Ti {xi,N^)) 



se rta 



A{T{y)f 



-dydx 



A{N-y{xi))A{T^ {x^,N-y)) 



Si J qzix) 



A{T{y)f 



dydx 



1 



466560 



(207360 + 404640 - 483840 r - 142920 + 1 7687 r" 



For xi e i?5, 

P (X2 e A^J (Xi), X3 G Fi (Xi, iVJ) , Xi e P5) 



S6 /•«3(a:) 



S4 J q2(x) 



A(r(3;))3 



1/2 



S6 Jq2{x) 



A{N^{xi))A{ri{xi,N^)) 



dydx 



■ (399064320 r - 150792192 + 171990000 - 391461120 - 31140648 + 1230359 1 

67184640 



For xi e P7, 



= 829340 ( 2 7 2 7 r-* - 3648 - 52736 + 166656 r - 121600)(-4 + 3 r)^. 



Then, 



Figure 41: The regions corresponding to the three cases for r E [3/2, 2) with r = 1.65 



So, 

E[/li2/ll3] 



5467 r^" - 37800 + 89292 + 46588 - 191520 + 13608 + 241920 - 155520 



15552 



233280 



Thus, for r G [4/3, 3/2) 



i/(r) 



5467 r^" - 37800 + 61912 + 46588 - 191520 + 13608 r'' + 241920 - 155520 



+ 15552 



233280 



For r e [3/2,2), there are three cases regarding Fi (a;i,iVj) and two cases for Ny{xi). The prototypes of 
these three cases as in cases 4,5, and 7 of Figures [571 and 15^1 Each case j, corresponds to the region Rj in Figure 
1411 where qj{x) are same as before for j = 3,4, and Sj, j = 3,4,6 are same as before and sy ~ 3/(4 r). Observe 
that for xi e i?4 U i?5 U i?7a, N^ixi) = Tr{xi) C T{y), and for xi e i?76, iV^^la^i) = T{y). So there are four 
regions to consider to calculate the covariance. 



Then, for 



Xi 



{x,y) G Rj, Ti (xi,Ny) are same as before for j = 4,5,7. The explicit forms of Rj, 



4, 5, 7a, 7b are given below (the explicit form of R^ is same as before): 

i?4 = {{X,lj) e [0,53] X [0,4m (x)] U [S3,se] X [q3{x),ea7n (x)]} 

R5 = {{x, y) e [S3, se] X [0, q^ix)] U [sg, 1/2] x [0, g4(x)]} 
^7,Q = {(a;,y) € [s6,S7] X [qi{x),£ani {x)] U [s7, 1/2] X [qi{x),ls{r,x)]] 
R7,b = {(a;, y) e [57, 1/2] x [4(r, a;), 4™ (x)]} 



Now, 



p({x„x3}civi(xo,x.eT.)^ r r^^^'^^i^^.^- r r^'^^ i^ity ^.^^ 



A(r(^))3 ' J, J, AiTiy)r 

/■'^-W A(iVJ(xi)f fi/^ ^^.^ (^) 1 -480 + r'^ + 768r- 320r^ 

^ io A(r(J))3 '^^^'^- + 7^^ X^^^^ A(r(^))''^""- 480r2 



Next, by symmetry, PJ^ = 6 ^({^2, ^3} C Ti {Xi,N^) , Xi e Ts) , and, let Sj := {4, 5, 7}, then 
P{{X2,X3} C Ti {Xi,N^) , XieT,)=J2 P{{X2,X3} C Ti (Xi,iV^) , Xi e Rj). 

= Si 



For xi G i?4 



P ({^2, X3} c Fi (Xi, , X, e i?4) = / / A{T{y)r '^^'^'^ 



Jo 



p Z-^- ^ (Fi (xi,iV^))' (237 r-^- 956 r3 + 17287-'- 1584 r + 592)(-2 + r)'^ 

/s3 X3(.) ^(r(3^))3 ^ 480 r4 



For xi S i?5, 



P{{X2,X,] C Fi (Xi,iVj) ,X,eR,)= / ^^.^^^dydx 



S3 "'0 



A{T{y)f 



1/2 |-94(:i;) (a;i,Afj)) _ (r- 2) (1909 r^- 6142 r'* + 10036 r^- 14808 r2 + 15024 r- 6048) 



Jo AinyW " 2880 r 

For xi e i?7 the result is same as before. So 



4 



P2G = 6 



7320 - 984 r'"^ + 13 - 20480 + 27840 r' - 18816 r + 5152\ 

144074 ) 



7320 - 984 + 13 r^^ - 20480 + 27840 r' - 18816 r + 5152 



240 r4 

Furthermore, 



P {X2 e Ny(Xi), X3 e Fi [Xi,Ny) , Xi e T,) = ^ ^^^^^^-^3 —dydx 



A{N^{xi))A{Ti{xi,N^)) 



For xi e i?4. 



P(X2 e iVj(Xi), e Fi (Xi, ivj) , x^ e P4) = / / A(r(y))3 '^^^^ 



"'0 



S6 /•^-"(^) A(Ar{;(xi))A(Fi (a;i,7VT,)) 1 . , w ^4 



For xi e i?i 



■5, 



P (^2 e iVJ(Xi), X3 e Fi (Xi, iVJ) , Xi e P5) 

/-^sC.) A(7V5(xi)) A (Fi ^ f^'^ r'"^ A(jV^(xi))A(Fi 



.3 Jo A{T{y)f " A, y„ A{T{y)f 

^ (-2 + r)(7535r^ - 35210 r'* + 9500 + 181560 r' - 308880 r + 147168). 



92160 
For xi e Rja, 

n f'^-"'^''^ A(NZixi))A(Ti(xi,N^)) 

P {X2 G N^yiXr), X3 G Fi {X^.N^y) , X, e Rj^) = / / 1!t-A;^\3 '^^^^ 



A(r(3^))3 
53 

dydx = """ r" ^ r"* + - 



/■^^(") A(iV^(xi))^(Fi(a;i,iVj;))^ ^ 303 g 91 5 53 4 ^ 235 3 173^2 



^(r(3^))3 10240 768 128 72 24 

101 2 , 3 . 16 3 , 4 18 

r + -r^^ r ^ r 3+47--* 

15 3 4 9 5 



J J qi(x) 




303 6 


91 


10240 


768 




34 




15' 



For xi G i?7b, 



p {X2 e N^ix,), X3 e Ti (Xi, , Xi e Rn) = 

ri/2 r^.^i-} A{N^{x,))A{r,{xi,N^)) (2 - 4r2 + 4r - 3)(2r - 3)^ 



A{T{y)f 



12 r6 



Hence 



7 r^^ - 72 r" + 240 r^" - 1440 + 3456 - 10296 + 15360 + 6720 - 40960 



46080 r^- 27648 r + 8640 



11520 r*^ 



7 r^^ - 72 r" + 240 r^" - 1440 + 3456 



Thus 

E[/ii2 /113] 



10296 + 15360 + 6720 r'' - 40960 r"^ + 46080 - 27648 r + 8640 



7 r^^ - 72 r" + 252 r^" - 1492 + 7392 - 43416 + 106496 - 110400 



1920 



34304 + 25472 - 27648 r + 8640 



960 r'' 



Therefore, for r e [3/2, 2) 



7 r^^ _ 72 + 312 r^^ - 5332 + 15072 + 13704 - 139264 + 273600 - 242176 



103232 r^- 27648 r + 8640 



960 r'' 



For r e [2,00), there is only one case regarding Fi {xi^Ny), namely i?7, and two cases regarding Ny{xi). 
Furthermore, 57, is same as before and ss = I/''- Observe that for xi S i?7a, Ny{xi) = Tr{xi) C T{y), and for 
xi £ i?7fc, Ny{xi) = T{y). So there are two regions to consider to calculate the covariance. 







X / 




/ ^^^^^^^^^ 




/ ^^^^ 




lltC — . . , , , , . , , , , , , , 





Figure 42: The regions corresponding to the two cases for Ny{xi) for r G [2,00) with r = 2.5 

For xi — {x,y) G i?7, Fi (^xi,Ny) is same as before. The explicit form of i?7, is same as Tg. For Rja and 
i?7b, see below: 

R7,a = {ix,y) e [0,57] X [0,4m (x)] U [57,^8] X [0,£^(r,x)]} 

Rtm {(a;,j/) e [s7,S8] X x), 4m (a;)] U [ss, 1/2] x [0, 4m (r, x)]} 



Now, 



fss r'^a^(x) ^ ^1/2 ^ 1 2 1 



Ju.) A{T{y)) ' 4 Jo ^(7^(3^)) 3 6- 

Hence PJ^ = 1 - 2r-2. Next, 

P({X2,X3} C Ti (Xi,iVJ) , Xi e T,) = P({X2,X3} C Ti (Xi,iVJ) , Xi e P7) 
1/2 .£_MA(ri(xi,iVj))' , 34- 45r2 + 15r4 



^(r(3^))3 ^ • 90 r 



4 



Q„ pr _ 34-45 r^ + lSr' 



Furthermore, 

P(X2 e 7V3;(Xi), X3 e Ti (Xi,7Vj) ,Xi e Ts) = Jp' — A(T{y)r — —dydx 

— Jo Jo A{T{y))3 "y"'^ + Jsr Jo A(T(3;))3 "y"-^ 

r.S8 (a;) ^(ri(:ri,jV^)) .1/2 .C™ A(ri .AT^) ) _ 25-48 r+90 r^-90 r^+30 r"^ 

+ JsrJe,{x) A(T(y}y^ — (^y(^^ + Js8 Jo A{T{y)y^ — ayax ysoT^ ■ 



c„ pr _ 25-48 r+90 r^-90 r-*+30 r° 
~ 307B ■ 

Hence, E[hu /^is] = 60r°-i65rYm.^-48r+25 ^ Thus, for r e [2,oo), 

15r'*-llr2-48r + 25 

"("^ = 157^ ■ 

For r = 00, it is trivial to see that i>{r) ~ 0. 



Appendix 2: The Mean fi{r,e) Under Segregation and Association 
Alternatives 

Derivation of ^{r,e) involves detailed geometric calculations and partitioning of the space of (r, e,2:i) for r G 
[1, cx)), e S [O, v^/3), and xi £Ts = Ts Sec Appendix 3 for the derivation of /i(r, e) at a demonstrative interval. 

/i5(r, e) Under Segregation Alternatives 

Under segregation, we compute /i5(r, e) explicitly. For e G [O, V3/8), e) = Sj=i '^ijC'": ^) ^ ^j) where 
wi,i(r, e) = 



STBr^e* - 1152 6"* - 37 + 288 



216(2e + l)2(2e- 1)2 

i^i,2(r,e) = - 1^576 r'^e* - 1152 r^e'' + 91 + 512 \/3r^e + 2592 r^e^ + 1536 \/3r- + 1152 

- 768 - 2304 e - 6912 r e'^ - 2304 \/3 + 1728 + 3456 \/3r- e + 5184 

- 1728 r - 1728\/3e + 648j/[216r^(2e+ l)^(2e - 1)^ 



roi.3(r,e) = -[l92r*e* - 384 r^e" + 9 r" + 864 r^e^ + 512 r + 384 e* - 2304 re^ - 768 

- 288 + 1728 + 576 r - 324 ] / ^72 (2 e + 1)^(2 e - 1)^] , 
ti7i,4(r, e) = - [l92 e^* - 384 r'^e^ - 9 r'* - 96 ^r^e + 288 r^e^ - 128 + 144 + 576 ^r^e + 256 
73 - 720 - 1152 e - 576 + 1152 r + 768 \/3 e - 612j / ^72 7-^(2 e + 1)^(2 e - 1)^] , 
48r''e* - 96r^e'* + 72 r^e^ - 32e'* + 64 V^e^ - 18r^ - 144 6^+27 



^i,7{r,e) = 1, 



18r2(2e + l)2(2e-l)2 

'e'' + 256 r^* - 128 ^r^e^ + 288 r^e* - 192 ^r^e^ + 72 r^e^ + 18 + 48 ^3 e - 45 

18(2e + 1)2(2£- l)2r2 ' 



with the corresponding intervals Xi = 



1,3/2 - V3e), X2 = 3/2 - V3e,3/2), X3 = 3/2, 2 - 4 e/\/3) , 



14 = 



2-4 e/x/3,2),X5 = 2, ^3/(2 e) - 1 ) , Xg = ^3/(2 e) - 1, \/3/(2 e) ) , and X7 = V3/(2e 



For e e 



V3/8, V3/6j , fj.s{r, e) = Z]J=i ^2,j(?', e) e 2^-) where n72j(r, e) = n7ij(r, e) for j = 1, 2,4, 5, 6, 



and for j = 3, 7, 



576 r'^e^ - 1152 r^e'' + 37r'' + 224 VSr^ e + 864 r^e^ - 384 e'' - 336 - 576 VSr^ e 



+ 768 V3e3 + 432 - 1728 + 576 ^/3 e - 216 
^2j{r,e) = 1, 



/ 216r2(2e + l)2(2e-l)^ 



1,3/2- x/3e), X2 - 3/2-^/3e,2-4e/^/3),X; 



with the corresponding intervals Ti 

X4 = [3/2,2), X5 = [2,\/3/(2e) - 1), Xe = \/3/(2 e) - 1, \/3/(2 e)) , and X5 = \/3/(2e),oo 



2-4 e 



/%/3,3/2), 



For e e 



\/3/6, \/3/4) , /xs(r, e) = Ti73,j(r, e) I(r € Ij) where 1^3,1 (r, e) = 1371,2 (r,e) and 



t^73,2(r,e) = -[576r*e** - 1152 r^e* + 37r* + 224 %/3r^e + 864 r^e^ - 384 - 336r^ - 576\/3r^£ 
+ 768 \/3 + 432 - 1728 + 576 \/3 e - 216] / [2I6 r^(2 e + 1)^(2 e - 1)^] , 

ti73,3(r,e) = [576 r^e* + 3072 re" - 1536 V3r + 3456 e** - 2304 \/3e^ - 37r^ - 224 \/3r e 
+ 864 + 336 r + 576 \/3 e - 432] / [2I6 (2 e + 1)^(2 e - 1)^] , 



ii73,4(r,e) = 1^192 r'*e* + 1024 r^e* - 512 ^r^e^ + 1152 r^e"* - 768 ^r^e^ + 9r'' + 96 \/3 r^e + 288 r^e^ 

- 144 - 576 V3r^e + 720r^ + 1152^/3^e - 1152 r - 576 \/3e + 54o] / ^72 r-^(2 e + 1)^(2 e - 1)^] , 
48 e"* + 256 - 128 ^3 r^'e^ + 288 r^e"* - 192 ^/3 r^e^ + 72 r^e^ + 18 + 48 \/3 e - 45 



18r2(2e + l)2(2e-l)2 



^^3,6('', e) = 1, 

with the corresponding intervals Xi = 1, 2-4 e/\/3^ , X2 = [2-4 e/V3, %/3/(2e)-l), X3 = [V3/(2 e) - 1, 3/2^ , 
X4 = [3/2, 2), X5 = [2,V3/(2e)), and X5 = [V3/(2e),oo 



For e € 



\/3/4, \/3/3j , ^s('', e) = ^4,i(?', e) l{r £ Ij) where 

Or^e^ + 2x/3r2e + 48re2 + - 16 V3re- 90 6^ - 12r + 36 V3e 



18 (3e- V3) 

ro4,2(r,e) = -[Or^e* -4 VSr^ + 48 r'^e'' - 48 \/3 r^e^ - 90 r^e'' + 36 r^e^ + 96 V3 r^e^ - 126 r^e^ 
- 32 V3 r - 48 e V 36 V3 r^e + 144 r + 96 VS - 18 - 72 V3 r e - 216 + 36 r 
+ 72\/3e-27 / 2 (^3 e - \/3) , 



with the corresponding intervals Xi 



1,3- 2e/\/3), X2 = 3 - 2e/V3,y3/e- 2), andXa = V3/e-2,oo 



/iyi(r, e) Under Association Alternatives 

Under association, we compute fJ,A{r, e) exphcitly. For e e [O, (7 V3 — 3 vTs) /12 w .042), e) = Sj=i ^ijl*"! ^) ^ 
X, ) where 

^i7i,i(r,e) = - [3456 e^r-'' + 9216 e^r^ - 3072 ^eV"* - 17280 e'^r^ - 3072 \/3eV^ + 2304 eV* 

+ 4608^eV^ - 2304 + 6336 e"* + 6144 r + 6912 eV^ + 512 ^er^ 

- 101 r"* - 6144^/3e^ - 11520 - 1536 ^er^ + 256 + 5760 + 1536%/3er 

- 384 r^-512\/3e + 256r--64j/[24 (^6e+^)^ (^6e-^^%^], 

ti7i,2(r,e) = -[l728e^r-^ - 1536 V3eV^ - 31104 e^r^ + 1152 eV"* + 15552 e"* + 10368 eV^ - 37 

- 20736 r + 10368 e^] / [24 (g e + Vsj^ (& e - r^] , 

^i.3{r, e) = [-2592 e^r" - 2304 %/3eV - 46656 e*r^ + 1728 e^r* + 10656 - 9216 VSe^ r 
+ 9072 eV^ - 432 \/3 e - 15 + 12288 VSe^ - 13824 e V + 1728 ^ er ~ 216r 
+ 4032e^- 2304 \/3 e r + 432 + 1024 \/3 e - 384 r + 128] / [36 (ee + VsY (de- VsJ^ r^j, 
1728 e^r^ - 1536 \/3eV^ - 31104 e^r'^ + 1152 eV-* - 5184 e* + 2592 eV^ ~ 37 r"* - 3456 



roi,4(r, e) = 
^^i,5(r, e) = 



24 (6e + \/3)^ {6e-Vsfr^ 
9 1152 e^r^ + 192 - 192 eV^ ~ r"^ + 128 + 32 - 64 r + 36 

9 (r + 6)(r-2)^ 



with the corresponding intervals Ji = [l, ^^^^^ ^ , 12 = 



1+2 v^. 4_(W3^\ _ 



4 (i-^e) 4 (1+2 ^e) 
3 ' 3 



4 (1+2 ^e) 3 , ^ _ 



2(l-V3£) ' 



2 andTe = [2, 00). 



For e e 



J = 1, 3, 4, 5, 6 and 



(7\/3- 3^15) /12, V3/I2]), HA{r,e) = ^ajl?-, e) I(r- € Ij) where TO2j(r, e) = n7ij(r, e) for 



-3456eV + lllr"* - 5184 e + 4608 \/3 e V* - 336\/3er3 - 168r3 - 13824 e 



4^3 



+ 4608 V3 eV^ + 3456 eV^ + 144 - 6912 Vs eV^ - 3888 eV^ + 576 Vs e + 25920 eV 
+ 3168 + 2880 - 256 \/3 e - 32 - 3072 ^3 e^l / [36 {Vi + 6 e) V-6 e + \/3 



with the corresponding intervals Ii 



li = 



4 (1+2 V3e 3 

3 ' 2(1- 73 



1+2 V3£ 4 (1+2 V3£ 
l-y3£ ' 3 



,^5 = 



For e e 



^^3,2(?', e) 



Y^^,'^) andXe = [2,oo). 
V3/12, V3/3) , e) = Ej=i ^^3j(^ e) II?^ e 2:^ ) where 

2r2 - 1 



6 

432 e^'r^ + 1152 - 576 VSeV + 1296 e'^r^ - 960 VSeV^ + 864 eV'^ - 864 VSeV^ 

576 eV'"' - 192 %/3 e - 360 + 648 e + 64 \/3 e + 48 r'' + 192 VSe^ - 144 Vs e 

64 - 504 + 72 + 88 \/3 e - 25j / [16 (3 e - VS 



^^3,3(f, e) 



-54 + 36 V3 e r2 + 15 - 18 r2 + 2 V3 e + 20 



with the corresponding intervals Xi 



6 (-3e+x/3) r2 



1, i+2^% yx3 = 

' 2 (l-V3e) / ' 



1+2 73 e 



2 (l-V3e) ' 2(l-V3e) 



2 (l-\/3e) 



, CX) 



Appendix 3: Derivation of /i(r, e) 

We demonstrate the derivation of /isl?": e) for segregation with e e 0, a/3/8^ and among the intervals of r that 
do not vanish as e ^ 0. So the resultant expressions can be used in PAE analysis. 

First, observe that, by symmetry, 

^is{r,e)=P{X2 e 7VJ(Xi,e)) =6F(X2 G 7VJ(Xi,e),Xi e r,\nyi,e))- 

Let (;(yj, x) be the line parallel to and crossing T{y) such that (i(yj, q{yj,x)) ~ e for j = 1, 2, 3. Furthermore, 
let T, :=r(3^)\u3^ir(yj,e). Then g(yi, a;) = 2 e - ^3 a;, 9(y2,a;) = V3x~V3 + 2e, andg(y3,x) ==^3/2-6. 
Now, let 

Qi =g(yi,a;)nyTW= (2e/\/3,o) , Q2 = 9(y2, n ylW = (l-2e/\/3,o) , 

Q3 = 9(y2, 2:) n yiW = (1 - e/%/3, e) , ^ qiys, x) n yiW = (l/2 + e/Vs, V3/2 - e) , 

Q5 = g(y3,a;)nyTy:^= (l/2 - e/VS, %/3/2 - e) , Qe = g(yi, 2^) n ylyj = (e/V3, e) . 

See Figure m Then r(yi,e) = r(yi,0i,Q6), T(y2,e) - T(Q2,y2,Q3), and r(y3,e) = r(Q4,Q5,y3), and 
for e e [0, ■\/3/4), is the hexagon with vertices, Qj, j = 1,...,6. Now, let q2ix) be the hne such that 
rd{yi,q2{x)) = d(yi,^(Q2)) = d{yi,y^) - e, 93(0;) be the line such that r d(yi, 53(0;)) = d(yi,y^). Then 
92(3;) = —VSx + (v^ — 2 e) /r and 93(2;) is the same as £s{x) before. Let the x coordinate of q{yi,x) (1 iam {x) 
be si, 92(2;) n^am (x) be S3, and £s{x)n£am {x) be S5 and Qi = (s2,0), 172(2;) nyTy2 = (s4,0), and £s(a;)nyTy2 = 
(s6,0). So si V3e/2, S2 ^ 2e/V3, S3 = (3 - 2 e \/3) /(4 r), S4 = (3 - 2 e \/3) /(3 r), S5 = 3/(4r), and 
S6 = 1/r. 

See Figure 33] for an r e [2, V3/(2e)). Furthermore, for xi = {x,y) € Rcm (yi), let 

/7i := q{y2,x)r\£r{xi, x) = ((\/3y/6 + .t/2) r + 1/2 - e/^/3, (2//2 + ^/3 a;/2) r - V3/2 + e) , and 
U2 ■.^q{yi,x)fMr{xi,x) = ((y/VS + x) r + e/VS - 1/2, V3/2 - e) . 

Let ^{ai, a2, . . . , a„) denote the polygon with vertices ai, 02, . . . , a„. If xi is below q2{x), then A'J(xi, e) = 
A{N^{xi))\T{yi,e) = ^(Qi, TVi, 7V2, Qe), if a;i is between q2ix) and then A^j;(a;i,e) = ^(Qi,Q2,C/i, 

U2,Q5,Q6), and if xi is above isix), then 7VJ(a;i,e) = T^. 



Figure 43: The support under for e G (0, V3/4) and the two types £r{xi,x) for ?'i < r2. 




yi = (0, 



*iQi\=(.'i2,0) ^T^^ X« Ai:, = 0) 



Figure 44: The partition of Ts for different types of Ny{-, e) under ijf with r e [2, \/3/ (2 e)). 



For r e 
Then 



since e small enough that q2{x) n Ts = 0, then N{x, e) C for all a; G T^, \ T(yi, e). 

»l/2 ^(iVr(2.^^g)) 



S2 "'O 



..^X2 -dydx 

Isi Jq(yu'x) ^UcJ 

(576r2 - 1152)e4 + 288e2 - 37 
1296(2e- l)2(2e+ 1)2 ' 



where A(iV5(a;i, e)) = A(^(gi, TVi, iVa, Qe)) = ^ i a; y + j - ^ and A(TO = ^3/4 

•\/3e2 and tam{x) = xjyfi is the equation of the line segment \I\Mq. Hence for r e [1,3/2), /i(r) 

(576r^-1152)e''+288e^~37r^ 
216 (2 £-1)^(2 ■ 



For r e 



3/2,2-4e/\/3), ^(a;) crosses through Mi,Mc. Since e small enough so that q2(x) does the same. 



So below 92(2;) is equivalent to x\ G i?i(e) where 

i?i(e) = {(x,y) e [33,85] X [92(2;), 4m (2:)] U [32,83] X [92(2;), 4m (2:)] U [s3,l/2] X [0,92(2-)]}. 



Then 



JQ Jo 

A{^iQi,Ni,N2,Q6)) 



AiT^y 



"1/2 fi^i-) A{^^{Q^,N,,N2,Q,)) 



dydx+ / / ^^^^^^ dydx 



S2 "'O 



dydx 



(384r2 + 576 - 192r4)e'' + 512e3V3, 



(288 - 1728)6^ + (-576 VSr + 86AVtj e - 9r* - 324 + 288 r / 432 ((2 e - 1)^(2 e + ifr'^) 

where A{^{Qi, Ni, N2, Qe)) is same as before. 

Next, xi between q2{x) and is{x) is equivalent to xi G i?2(e) where 

i?2(e) = {(a;,y) e [59,52] x 4™ (x)] U [55, 1/2] x [g2(a;),4(x)]}. 

Then 

2 V3 e (10 e^V3+(32 r-24)e^ + (-27 V3+12 VSr) £-18r+27) 



P{X2 e A^J (Xi, e), Xr S i?2(6)) = /,;^ Q;!"-' -^-^v:.v..u.,u.,v..va;; ^^^^ 



1/2 r<32(a:) ^(^^(Qi.Qa.C/i.C/a.Qs.Qe)) 



dydx 



27 (4£^-l)V^ 



where 



yl(^(Qi,Q2,C/i,C/2,Q5,Q6)) = -V3e2+ (^2-2ry/V3-2rxj g + ^ y + ^3 r a- - V3/2 - VS y2/12 

- r^xy/2 - V3r^ x^/A. 



Furthermore, xi above £s{x) is equivalent to xi E i?3(e) where i?3(e) = {(a^, y) € [ssi 1/2] x [^am (a;), ^s(a;)]} . 
Then P{X2 e A^J (Xi, e), e R^ie)) = /^f //^^^^ 
Hence for r e [3/2,2), 



^1/2 ^e^^ix) 1 , , (2r-3)" 

ftif/OX — 6(2e--l)(2e+l)r^- 



^(^(r, e) = 6 ^(-384 + 384 + 192 r^)e* + (-768 ^3 + 512 ^3 r)e^ + (1728 - 2304 r + 864 r^)e'^ 
-288r2 - 324 + 9 + 576 r) / 432 ((2 e - 1)^(2 e + 1)^ r^) ) 
= - (-384 + 384+ 192 r4)e'^ + (-768 \/3 + 512 \/3r)e''^ + (1728- 2304 r + 864 r2)e2 
-288 - 324 + 9 r'* + 576 rj / [72 ((2 e + 1)^(2 e - 1)^^) . 



For r e [2,00), £six) crosses through yiM^, so the same types of Ny{xi,e) occur as above. The explicit 
forms of Rj{e), j = 1, 2, 3 change and are given below: 

i?i(e) = {ix,y) e [si,S2] X [9(yi,a;),4„ (x)] U [52,^3] x [0,4„ (x)] U [53,54] x [0, q2{x){x)]} 
i?2(e) = {ix,y) e [53,55] X [92(3;), 4m (x)] U [55,54] X [92(2;), , 4(2;)] U [54,55] x [0,4(a;)]} 
i?3(e) = {(2;,y) e [55,55] x [4(a;),4m (a;)] U [56,1/2] x [0,4m (a;)]}. 



Then 



S2 /-^a 



P(X2eiVJ(Xi,e),Xiei?i(e)) = 

/si Jq{yi,x) 
.S3 A(^(Qi,iVi,7V2,Q6)) 



A(^(Qi,iVi,iV2,g6)) 



dydx 



dydx 



-9 + (-32 + 16 + 16)e4 - 48 + 24 e V3 
36 (4e2- l)2r2 



A(r,)2 

p A(_^(gi,iVi,iV2,Q6)) 

S3 Jo 



dydx 



where A{^{Qi, Ni, N2, Qe)) is same as before. 
Furthermore, 

P [X2 e Ny(Xi,e),Xi e R2{e)) ^ / TFFyi '^V^^ 

Js3 Jq-lix) ^[r'-e) 
^ n /'^^"^ A{^{Q,,Q2,Ul,U2,Q5,Q6)) , , ^ n /-'^^"^ A{^{QuQ2Ml,U2,Q5,Q6)) , , 

2 V3 e (-27 e V3 - 24 £2 + 10 e3 V3 + 27) 
81(4e2- l)2r2 ' 

where A{B^{Qi, (32, ^^i, C^2, Q5, Qe)) is same as before. 
Next, 

P {X2 e N^iX,,e), X, e R,{e)) = Q J^-/^ Ah'^ydx + J^^- ^^2/^^ = eflT^^- 
Hence for r G [2, \/3/(2e) - 1), 

(48 - 32 - 96 r2)e4 + 64 e^^S + (72 - 144)e2 + 27 - 18 \ 



/is(r,e) = 6 



I 108(4e2- l)2r2 

(48 - 32 - 96 r2)e4 + 64 e^^s + (72 r2 - 144)e2 + 27-18 



18 (4e2- 1)2^.2 

For r = 00, it is trivial to see that fi{r) = 1. In fact, for fixed e > 0, ii{r) = 1 for r > VS/ (2 e). 



Appendix 4:Derivation of fis{r,e) and i's{r,e) for Segregation with e = 

V3/8 

For the segregation alternative with e = -\/3/8, (r, e = -v/S/S) = 1 for r > 4, so we find (^1 £ = for r g 

[1,4). In particular, for the mean we partition [1,4) into five intervals, [1,9/8), [9/8,3/2), [3/2,2), [2,3), [3,4), 
and for the covariance into twelve intervals, [1,12/11), [12/11,9/8), [9/8, a/6/11), [\/6/11, 21/16), [21/16,4/3), 
[4/3,3/2), [3/2, V3), [^3,7/4), [7/4,2), [2,3), [3,7/2), [7/2,4). We pick the sample intervals [3/2, 2) and [7/4,2) 
to demonstrate the calculations of the mean and the variance, respectively. Then observe that, by symmetry, 

fis{r,e)=P{X2 e N^iXi,e))^6PiX2 G A^J(Xi,e),Xi G i?e (yi)). 

Then g(yi,a;) = -\/3 (1/4 — x), q{)/2,x) = -y/S (4 a: — 3)/4, and q{y3,x) = 3\/3/8. See Section ?? for the definition 
of q{yj,x). Hence, Qi = q{yi,x) n ylyj = (1/4, 0), Q2 = q{y2, x) n ylyi" = (3/4, 0), Q3 = q{y2,x) n yiyi = 
(7/8,3/8), Q4 = g(y3,a^)ny^= (5/8,3a/3/8), Q5 = g(y3, a;) n ylyi = (3/8, 3 ^3/8) , QQ = q{yl,x)f^ 
yTyi= (1/8,3/8). Then r(yi,e) = r(yi,Qi,g6), T(y2,e) = T(g2, 2/2,^3), and T(y3,e) = TiQ^^Q^.y^i)., and 
for e = a/3/8, is the hexagon with vertices, Qj, j = 1, . . . , 6. 

Furthermore, q2{x) = — \/3(a; — 3/(4 r)) and qz{x) ~ V3(l/'' — a;) (i.e. the same as ls{x) before). See 
Section ?? for the definition of qj{x). Let the x coordinate of g(yi, x) n ^am {x) be si, q2{x){Mam [x) be 54, and 
£s{x) n tam {x) be sio and Qi = (s3,0), q2{x) n ylyi" = (se, 0), and ls{x) n yTy2 = (si2, 0). So si = 3/16 and 
S3 1/4, S4 9/(16r), se = 3/(4r) = sio, and S12 = 1/r. See Figure H51 

For xi = {x,y) G Rcai (yi), let 

C/i := (7(y2,a:) n£,.(xi,x) = (^\/3 (^4r y + 4 \/3 r x + 3 a/s) /24, r y/2 + \/3r x/2 - 3 \/3/8) , and 
U2 := '7(y3, x) n £r{xi,x) = (VS (8rj/ + 8\/3rx--3\/3) /24, 3 \/3/8) . 

If XI is below q2{x), then A^J (xi,e) N^{xi) \ T(yi, V3/8) = ^(Qi, iVi, TVa, Qe), if xi is between 92(2:) 
and is{x), then iV^y (a^i, e) = ^{Qi,Q2, Ui, U2, Q5, Qe), and if xi is above £six), then A^-y (xi, e) — T^. 



For r e [3/2,2), qsix) crosses through M^Mq and q2[x) crosses yiMg. So xi below q2{x) is equivalent to 
xi e Ri (\/3/8) where 



Ri (^V3/8j = {{x,y) e [si,S3] x [g(yi,a;),4m (2;)] U [53,34] x [0,4m (x)] U [s4, se] x [0,^2(2;)]}. 

Then 

/-^-t-) A(^(Qi,iVi,iV2,Q6)) /-'-'^"^ A(^(Qi,iVi,jV2,Q6)) ^ ^ 

dyda:+ / / ^7— ^ dydx 



J S3 Jo 



S4 v'O 



^(T,)2 ^ 676 r2 



where A(^(Qi,iVi,7V2,(36)) = 5^ (l2rx + 4\/3ry + 3) (l2 r x + 4 \/3 r y - 3) is same as before. 
Next, x\ between q2{x) and ts{x) is equivalent to xi G R2 (\/3/8) where 

i?2 (\/3/8) ^ {{X,y) G [S4,56] X [q2{x),^a„^ (x)] U [sg , 1/2] X [0,4(.t)]}. 



Then 



SB C^am {x) 



A{3^{Qi,Q2,Ui,U2,Q5,Q6)) 



P[X2eN^{X,,e),X,eR2{V3/8))=J^ j^^^^ y-^^^^^^-^y^^^^^^" dydx 

1/2 /-^^W ^(^(gi,Q2,C/i,C/2,Q5,Q6))^ ^ 64 r4 - 768 r3 + 1824 r2- 128 r- 1467 

dydx — 



S6 -^0 



A(re)2 " 2028 r2 

where ^(^(gi,Q2,t/i,C/2,g5,Q6)) = (-^2/' - 5 ^2/ - ^a^') + (| y + ^ x) r - if ^3. 

Furthermore, xi above 93(2;) is xi G i?3 (\/3/8) where i?3 (\/3/8) = G [sio,l/2] x [£s{x),£a7n i^)]} ■ 

Then 

ni/2 ^ 8(4r2-12r + 9) 



P(X2G A.i(X4,.),X4Gi?3(V3/8)) ^ WD'^y'^ 

Hence for r G [3/2,2), 

' 61 - 768 + 3494 - 5120 r + 2466 \ 61 - 768 + 3494 - 5120 r + 2466 



/ ^ / 6ir' - 768 r-^ + 3494 - 5120 r + 2466 \ 
MS (r, x/3/8j = 6 J 



338 r2 



For r > 4, it is trivial to see that /i(r) ~ 1. 

To find the covariance, we need to find the possible types of Fi {xi, Ny, e) and Ny (xi, e) for r G [1,4). The 
intersection points of £^j{r,x) with d{T{y)) and d{R{yj)) for j = 1,2,3, i.e. Gi — Gq and Li — ig sue same as 
before. Recall also Mc, Afi, M2, M3 and yi, y2, Vs. Then Fi (.ti, A^^^, e) is a polygon whose vertices are a subset 
of the above points. 

There are six cases regarding Fi (a;i,iV^y,e) and one case for Ny {xi,e). Each case j, corresponds to the 
region Rj (-\/3/8) in Figure [45] where q{yi,x), q2{x), qsix), sj for j = 1, 3, 4, 6, 10 are same as before and q4{x) = 
-y3(4x-r)/4, g5(x) = V3(2-r)/4ands2 = (r-l)/4, sg = (r^ - 2 r + 3)/(4r), S7 = 3r/16, S8 = (r--l)/2. 
(see Figure 145]). 

Then, for 

X, = {x,y) G i?i (^/3/8) ,Fi (xi,7Vj) = ^(Qi, G2, G3, G4, G5, Qe) 

XI G R2 (73/8) , Fi (xi, ivj;) = ^(gi, G2, G3, A/2, L4, L5, M3, ge) 




Figure 45: The regions corresponding to the seven cases for r G [1,4) with r — 1.9 

XI G i?3 iV3/8) , Ti (xi, iVj) = ^(Gi, G2, G3, G4, Gs, Ge) 
XI ei?4 (\/3/8),ri (a-i,iV-;;) =^(Gi,G2,G3,G4.G5,G6) 

The exphcit forms of Rj (-v/S/S) , j = 1, . . . , 4 are as foHows: 

Ri (V3/8^ = {{x,y) G [si, S2] X [^(yi, x), 4m (x)] U [s2, S7] x [95(2;), 4m (x)] U [s7, ss] X [g5(a;),g4(a;)]} 
i?2 (VS/Sj = {(a-, y) e [S2,S3] x [qiyi, x), q^ix)] U [s3,S8] x [0,q5{x)] U [53,59] x [0,g4(x)]} 
i?3(\/3/8) ={(x,y)e [s7,S8]x [(73(a;),4m(x)]U[s8,l/2] X [95(2;), 4m {x)]} 
i?4(V3/8) [53,59] x [<Z4(x), gsl^)] U [sg, 1/2] x [0,q5{x)]}. 

Let P2V(e) P ({X2,X3} C 7VJ(Xi, e)) , PJe(e) P({X2,X3} C rUXi,e)), andPX,(e) P(X2 G iV]; (Xi , e) , X3 € 

rU^i,e)). 

Now, by symmetry, P^j^ (^/3/8) = 6 F ({X2, X3} C (Xi, e) , Xi e T(yi, ^3/8)) . 
For r e [7/4,2), 

P [{X2, X3} C (Xi, e) , e T(yi, V3/8)) = A(r )3 ^^^"^ 

261 - 4608 + 29105 - 72960 + 32575 + 78848 r - 67620 
~ 65910 r2 ' 

where iV])(xi,e) = ^(Qi, iVi, Ar2, Qe) for xi G i?ia (%/3/8)ui?2a (\/3/8) , iVJ (xi, e) = ^(Qi, O2, C/i, C/2, Q5, Qe) 
for xi G (v^/8) U i?2b (v^/8) U i?3a (^/8) U E4, and (xi, e) = for xi G i?3h (V3/8), all of whose 
areas are given above. Hence for r G [7/4, 2), 

r.,<.r ^ .rv , .r 26 1 - 4608 + 29 1 05 - 72960 + 32575 + 78848 T - 67620 
P({X2,X3} C Ny iX,,e)) = . 

Next, by symmetry, FJ^ (x/3/8) = 6 P {{X^, X3} C F^ (Xi, e) , G T, \ r(yi, \/3/8)) , and 

4 

P ({X2, X3} C F^ (Xi, e) , Xi G \ T(yi, V3/8)) = ^ F ({^2, X3} C F^ (Xi, e) , Xi G Rj (\/3/8)) . 



For xi e Ri (\/3/8), 



P({X,,X,}cr[iX,,e), X,eR,(V3/8)) = / / ^ y.r/' dydx 



18894 - 12248 - 131375 + 45360 + 584030 - 841816 r + 337155 
^ 65910 r4 ■ 

where A (Fi (xi,7V5,e)) = — ^ jg2^:5 '-. 

For a-i e i?2 (\/3/8), 

P [{X2, Xs} c F^ (Xi, e) ,x,eR2 (Vs/s)) = J J ^ ^ VrO^ 

^4 ^m)^ '^''"+4 Jo AiT^r ^ 

4 (4865 ~ 24063 + 49460 - 73210 + 98045 - 84107 r + 29010) 

3295574 ■ 

< /-r^ / \\ \/3f96+32a V3a; + 128 V3ry-224V3y-192r+962:+93r2-48a;2 + 176'y^) 

where ^(Fi (a;i,7VJ,e)) = — ^ 

For XI e i?3 (\/3/8), 

P ({^2, X3} C F'l (Xi, e) , G i?3 (%/3/8^ 



46293 - 100944 - 254880 + 506880 + 829440 - 2064384 r + 1048576 

9886574 ■ 



where A (Fi {xi,N^,e)) = — !^ j^T^ ^■ 

For xi e i?4 (\/3/8), 

p({X2,X3}cF^(Xi,e), Xi ei?4(\/3/8)) 



8 (3577 - 20548 + 45620 - 61760 + 79040 - 77824 r + 32256) 

3295574 ■ 

where ^(Fi (xi,jVj),e)) = V^(-6r+3.-3.^+5,^^+3r^+3+4 V3.y-7 Vs.) _ 
So 

\ /19032r6- 243648r5 + 1118355r4 - 2085120r3 + 1534050r2- 113664r- 233639\ 
2g(^v/J- 197730 r4 j 

_ 19032 ~ 243648 + 1118355 7-4 - 2085120 r' + 1534050 - 113664r - 233639 
~ 3295574 ■ 

Furthermore, by symmetry, 

P'm (V3/8) = 6 P (x2 e A^^ (Xi, e) , X3 e F^ (Xi, e) , Xi G \ r(yi, ^3/8)) , 



and 

P (X2 e N'y (Xi, e) , X3 e (Xi, e) , G Ts \ T(yi, ■ 

i 

where j G {la, 16, 2a, 26,3a, 36,4}. The expUcit forms of these regions are 



Rla 


Vs/sj = \ 


{x,y) G 


[S1,S2] X 


[q(yi , a;) , (i)] U [s2 , S4] x [55 (i) , (.am {x)] U [s4 , S5] X [55 (a;) , 52 (2;)] } , 


Rib 


'y3/8j = \ 


(a;,?/) G 


[S4,S7] X 


[92(3::), -^am (x)] U [S7, S5] 


X [92(2;), 94(2;)] U [55,53] X [55(2;), 54(2;)]}, 


R2a 


'\/3/8) = \ 


{x,v) G 


[S2,S3] X 


[g(yi,a::),g5(x-)] U [53,55] 


X [0,175(2;)] U [55,56] X [0,92(2;)]}, 


R2b 


'^/s) = \ 


(a^.y) G 


[S5,S6] X 


[q2{x),qT,{x)] U [56,58] X 


[0,95(2:)] U [58,59] X [0,94(2:)]}, 


Raa 


'Vs/sJ = \ 


G 


[s7,Sio] X 


[qA{x),tam {x)] U [5l0,S 


g] X [94(2:),g3(2;)]U[s8,l/2] x [95(2;), 94(2:)]}, 


Rib 


'\/3/8) = \ 


{x,y) G 


[sio,l/2] 


X [q3{x)Ja^ {x)\}. 





i?4 (^3/8) is the same as before. 
For xi e Rla (V3/8), 



P (^2 G iVj (Xi,e) , X3 G (Xi,e) , Xi G (%/3/8)) = j^""^ 



A(7Vj;(2:i,6))A(ri (2;i,jV^,6)) 



+ r ^-^-^^(^^(^.^)M(r.(x.iv^,e))^^^^.^ 

isa ■'95(3:) ^(2^) Js4 



5 /■92 



A(jV^(xi,6))A(ri {xi,N^,e)) 

x) 



dydx 



= - [296O - 3045 r** - 21504 r"^ - 28554 r" + 101040 r'"^ + 205785 r* - 550080 + 391392 r 



dydx 



- 114048 r + 12150] / [39546O T 
For xi G (\/3/8), 

P(X2 G 7VJ (Xi,e), X-i G rn^i,e), ^1 G Rxb (^3/8)) = /' 



S7 ftarr, (x) 



A{N^{xi,e))A{ri {xi,N^,e)) 



+ / / aJtY^ y , , 



S4 Jq2(x) AiTt)''^ 

/"^-tW A(jV^(ai,6))A(ri (2:1, iV^, 6)) 



dydx 



dydx 



A{T,y^ 

= -|(r-^ - 3)(3111r-'" + 96r'' - 67659 r'' + 26528 r' + 341223 r" - 352896 - 177291 r* + 229632 r 



+ 23922 - 22464 r + 3078)] / [395460 r*^] . 



For xi G i?2a (v3/8) , 

P (X2 G N^y (X.,.) , X3 G (X.,e) , X. G P2. (^3/8)) ^ /^^ ^(iVH^. e)M (r. (x., X^ , e)) 



dydx 



+ 



'■^ A(jV^(xi,6))yl(ri(xi,iVj^,6)) 



Js5 ^0 



S2 ■'9(yi,a;) 

/-"^W A(Ar^(2;i,6))yl(ri(xi,7V^,6)) 



^(rO^* " Js, Jo Ainr 

_ 2 (r - 2) (r - 3) (530 + 1225 + 1697 - 3399 - 3027 + 5274 r - 1 188) 

98865 r5 

For xi G i?2b (\/3/8), 

P [X2 G iVj (Xi, e) , X3 G n (Xi, e) , Xi G P2b (V3/8j j = J J 



dydx 



A{N^{x,,e))Air, ixi,N^,e)) 



dydx 



^(jV^(xi,6))A(ri (2;i,JV^,6)) , r» /'^(-) A(JV^(2;i,6))A(ri (2;i,JV^,e)) 

io A{T.r """"^J., Jo AiTJ^ 



dydx 









-3)(467r^ - 


- 50r^ - 7789 




1290 










98865 r 



For xiei?3a (%/3/8), 

" rl /2 rQ':i( x) A / Tf^rr /" \\^/t-i/ i\.rr \\ 



sio J qi{x) 



= [3934r^^ - 11040 r^^ 



dx 

>dydx+ r r'^'-^(^^(----))f(^3^(---^^--))d,d. 

- 2352 r-^" - 283680 + 1239855 - 751008 r"^ - 3225344 
+ 6125568 r'^ - 3847680 + 81920 + 1843200 - 2045952 r + 905472] / ^395460 r" 
For xi e i?3b 



(^/3/8), 



,1/2 (X) ^(^r (^^^ ^ (xi, iVj;, e)) ^ _ 64(2r4-4r^ + 4r-3)(2r-3)=^ 
A.o ^(7^0^ 507 r6 

For xi e i?4 



,(^/8), 

P (Xa G iVj (Xi, e) , X3 G (Xi, e) , Xi 

/■sg /■q5(a:) /I Ar''. I r,. . A IT. 



• A{Nl,{x,,e))A{T^{x^,Nl„e)) f'/' AjN^ {x^, e)) A (F^ {x, 

A{T.r ^ 4 Jo A(T,f 

8(48r-'' - 22 r'"^ - 73 r^* - 3114 + 4391 + 1574 r - 2850)(r - 2f 

32955 r2 ■ 

Then 



sg J qi{x) 



■dydx 



Plj{e) = 6(^- 49 r^2_;L53g ^11^ ;l 7952 ,,10 _;^29280r3 + 609420 r^- 1728768?' 
-3013632 + 3418140 - 

49 r^^ - 1536 r" + 17952 r^" - 129280 + (309420 r 



2757670 

3829760 + 3026880 - 1571616 r + 445284 



395460 

,6 



- 129280 + 609420 - 1728768 + 2757670 r 
3013632 r-' + 3418140 r"* - 3829760 + 3026880 - 1571616 r + 445284] / [65910 



So 



E 



1^3/8 [/112/113] - 



49 r 



1536 r" + 18735 - 143104 + 677703 - 1704000 + 1737040 
- 691968 + 1681230 - 3716096 + 3260519 - 1571616 r + 445284] / [3295 



Hence 

- 637ri2_ 

-602490624r^ + 6735581 



19968 r" + 299370 - 3265792 + 24051519 - 112023360 + 328179640 

10 r'^- 427086848 + 133604087 r^- 20431008 r+ 5788692 / 428415 



Derivation of ^J,s{f: e) and i^s('') for segregation with e = \/3/4 and with e = 2 V3/7 arc similar. 



Appendix 5: Proof of Corollary 1 



In the muhiplc triangle case, 



Kj 



By definition of Ny{-), P{X2 G Ny{Xi)) = if Xi and X2 are in different triangles. So by the law of total 
probability 

J 

fi{r,J) := P{X2eN^iXi))=Y,P{X2eN^iXi)\{Xi,X2}cTj)P{{Xi,X2}cT,) 
J 

= ^mW-P({^i,^2} CT,) {since P{X2eN^{Xi)\{Xi,X2}cTj)^ti{r)) 

J 2 
= Kr) J2[^iT,)/AiCHiy))] (since P{{X^,X2} C T,) = (A(r,)M(C«(3^)))') 

Then /i(r, J) = /i(r) • ^X]j=i ""^f ) where /i(r) is given in Equation ([5]). 
Furthermore, the asymptotic variance is 

i/(r,J) = E[/ii2/ii3] -E[/ii2]E[/ii3] 

= p{{X2,X3} c iVj(Xi)) +2P(X2 e N^{Xi),X3 e ri(Xi,iVj)) 

+P({X2,X3} C T,{X,,N^)) - 4 (//(r, J))2. 

LetPJ^ := P({X2,X3} C iVJ(Xi)), ^({^2,^3} C ri(Xi, TVj;)), and P^, ^(^2 £ 7VJ(Xi),X3 G 

ri(Xi,7VJ)). Then for J > 1, we have 

,7 

P({^2,^3} C A^J(Xi)) = ;^P({X2,X3} C iVJ(Xi) I {Xi,X2,X3} C T,) P({Xi,X2,X3} C T,) 

,/ / J 



^pj^(A(r,)M(c«(3;)))' = P2V 

i=i \i=i 



,3 



Similarly, P{X2 € iVJ(Xi),X3 S ri(Xi,7VJ)) = P-, (^/^, and P{{X2,X,} C ri(Xi,7VJ)) = PJ^ (E/=i 
hence, 

Hr, J) = {P2N + 2PI1 + P2g) (J2 - 4 = ^(0 (^E + 4 (^E "'^ - (E ^ ' 

SO conditional on W, if z^(?', J) > then ^/n {pn{r, J) — fJ-ir, J)) A/'(0, i^(r, J)). ■ 



